Unsupervised approach to decomposing neural tuning variability

Zhu, Rong J. B.; Wei, Xue-Xin

doi:10.1038/s41467-023-37982-z

Download PDF

Article
Open access
Published: 21 April 2023

Unsupervised approach to decomposing neural tuning variability

Nature Communications volume 14, Article number: 2298 (2023) Cite this article

2977 Accesses
2 Citations
11 Altmetric
Metrics details

Subjects

Abstract

Neural representation is often described by the tuning curves of individual neurons with respect to certain stimulus variables. Despite this tradition, it has become increasingly clear that neural tuning can vary substantially in accordance with a collection of internal and external factors. A challenge we are facing is the lack of appropriate methods to accurately capture the moment-to-moment tuning variability directly from the noisy neural responses. Here we introduce an unsupervised statistical approach, Poisson functional principal component analysis (Pf-PCA), which identifies different sources of systematic tuning fluctuations, moreover encompassing several current models (e.g.,multiplicative gain models) as special cases. Applying this method to neural data recorded from macaque primary visual cortex– a paradigmatic case for which the tuning curve approach has been scientifically essential– we discovered a simple relationship governing the variability of orientation tuning, which unifies different types of gain changes proposed previously. By decomposing the neural tuning variability into interpretable components, our method enables discovery of unexpected structure of the neural code, capturing the influence of the external stimulus drive and internal states simultaneously.

Control of working memory by phase–amplitude coupling of human hippocampal neurons

Article Open access 17 April 2024

Memorability shapes perceived time (and vice versa)

Article 22 April 2024

Perceptography unveils the causal contribution of inferior temporal cortex to visual perception

Article Open access 18 April 2024

Introduction

A central goal in neuroscience is to determine how neural responses depend on external stimulus variables and the internal states of the brain. The dependence of individual neuron’s firing rate on a stimulus variable is often described by the turning curve, i.e., the average firing rate of a neuron as a function of the stimulus^1,2,3,4. Because tuning curves are the consequences of various internal computations in neural circuits, it is likely and indeed empirically the case that it could be modulated by factors other than the stimulus variable selected a priori^{5,6,7,8,9,10,11,12,13,14,15,16,17,18,19,20,21}. Indeed, variability of neural tuning has now been widely reported in neural systems, and furthermore been proposed to exhibit various forms, including multiplicative gain^{6,18,22,23,24,25,26,27}, additive modulation^{5,28,29,30,31,32}, shift of tuning peaks^33,34,35, and tuning width changes³⁶. These observations reflect the role of various factors, whether related to stimuli (e.g., stimulus contrast³⁷, stimulus history^34,38), behavior (e.g., movement¹⁶), or latent brain states^25,28.

Tuning variability has been widely implicated both functionally, i.e., information encoding and the behavioral performance^{15,19,28,30,39,40}, and mechanistically, i.e., how tuning variability is generated^41,42,43. Prior studies have attempted to quantitatively model the variability of tuning in the sensory cortex, in particular orientation tuning in the primary visual cortex (V1), which is widely considered a paradigmatic case for studying neural code. Decades of studies in V1 have shed general insights regarding how neurons in the cortex encode external sensory variables. Perhaps surprisingly, studies of V1 tuning variability have yielded results that are seemingly at odds so far. One line of work^25,26 has proposed a simple multiplicative gain model to account for the tuning variability. Multiplicative gain has been postulated to have a vital role in encoding contrast⁴⁴, encoding uncertainty²⁷, facilitating downstream readout⁴⁵, implementing attention^6,7, as well as the transformation of coordinate systems (e.g., retina- to body-centered) in parietal cortex^46,47. Mechanistic models suggest that multiplicative gain could result from threshold-linear neurons operating in the presence of intrinsic intracellular noise^41,42,43. In contrast to the above work, other studies have suggested additive interactions^5,18,48 or both additive modulation and multiplicative gain^28,30,31 in V1.

Crucially, the analysis methods in the majority of this prior work presumed relatively restrictive structure for tuning variability (e.g., refs. ^{18,25,26,28,30,32}), leaving open the question of whether other forms of fluctuations might in fact account for the data better. Furthermore, existing analyses generally relied on trial-averaging and comparison across conditions^6,30,38, thus failing to capture the moment-to-moment variability in tuning. Addressing these open issues requires approaches that can infer the structure of tuning fluctuations directly on single-trial data—and ideally on the raw spike train itself—while also avoiding restrictive assumptions.

Here we introduce an unsupervised statistical technique, Poisson functional PCA (Pf-PCA), to identify the structure of of latent tuning fluctuations directly from neural spiking data. Importantly, we apply this method to address tuning variability in a classic neural system that has long been characterized via tuning, namely neurons in V1. Because Pf-PCA yields a generative model of the moment-to-moment tuning variability, where a moment is defined by a block consisting of responses for all stimuli, it could be used to analyze the information encoding through the calculation of information-theoretical measures such as Fisher information. It could also be used to analyze the geometrical structure of the neural manifold. Performing these analyses, we find that Pf-PCA reveals several insights. The proposed analysis framework is broadly applicable to other low-dimensional tuning modalities.

Results

Previous studies have suggested that the tuning fluctuations may be heterogeneous^30,38. This heterogeneity motivated us to develop a flexible, unsupervised analysis framework to understand the tuning variability. It may be used to analyze the tuning fluctuations of any one-dimensional variable with smooth tuning properties. We will first develop and validate our method, and then apply it to analyze Macaque V1 data. We will show that our method helps reveal insights into the structure of the neural code for visual orientation, the information content, and the geometry of the representation.

The Poisson functional PCA framework

Figure 1a illustrates the basic modeling framework of Pf-PCA (see “Methods” for details). The model assumes that the logarithm of the tuning curves (of an arbitrary stimulus variable) is determined by a smooth mean component and smooth functional principal components (fPCs) weighted by the amount of latent fluctuations. Note that each fPC is a function that is tuned to the stimulus variable. The fPCs and their weights (i.e., scores) together capture the fluctuations of the tuning curves. Quantitatively, the tuning curve μ_t for the t-th moment can be describe as

$$\log ({{{{{{{{\boldsymbol{\mu }}}}}}}}}_{t})={{{{{{{\boldsymbol{f}}}}}}}}+\mathop{\sum}\limits_{k}{\alpha }_{k,t}{{{{{\phi }}}}}_{k}+{{{{{{{{\boldsymbol{\epsilon }}}}}}}}}_{{{{{{{{\boldsymbol{t}}}}}}}}}.$$

(1)

Here f is the mean component, ϕ_k is the k-th fPC, α_k,t denotes the amount of fluctuation (i.e., score) for the k-th component during the t-th moment, and is assumed to follow a zero-mean Gaussian distribution. The last term ϵ_t is assumed to be zero-mean Gaussian noise that captures the residual unstructured fluctuations. The spike train at every moment is assumed to be generated from a Poisson process with the firing rate specified by Eqn. (1). Note that with only the first term f, this model is equivalent to the standard tuning curve model of spike counts. The remaining terms captures the additional variance from the contribution of moment-to-moment fluctuations, making the model naturally capture the over-dispersion of spike counts^25,32.

**Fig. 1: The Poisson functional PCA (Pf-PCA) framework.**

Our algorithm takes spike count data as the input and infers the mean, fPCs, the variance of each component, as well as the weight for each fPC for each moment. Critically, the shape of fPCs, which specifies the particular form of the fluctuations, is directly inferred from the data. When studying the neural response to continuous stimulus variables, it is natural to assume the mean component and fPCs of individual neurons as some smooth functions of stimulus. Importantly, our method merely assumes that the mean component and fPCs are smooth, without imposing restrictive assumptions on their shapes. Our method provides a way to parse the variability of tuning into a set of fPCs from the spike counts. Now consider a couple of special cases. With the additional assumptions that there is only one fPC and that it is constant over the stimulus dimension (Fig. 1b), our model becomes essentially the multiplicative gain model²⁵. When tuning curves exhibit systematic lateral shifts, our model could capture it with an fPC that is proportional to the derivative of the mean component (Fig. 1c). It is worth emphasizing that our method is general and capable to capture other cases that may be potentially more complicated. It may be used to analyze the tuning fluctuations of any one-dimensional stimulus variables with smooth tuning properties.

Our method is developed by adapting a technique, i.e., functional PCA^{49,50,51,52,53}, to deal with Poisson spiking noise. As the firing rate is not observed, it makes the inference procedure more challenging. We resolve this problem by developing a procedure based on an Expectation-Maximization algorithm. Details of the inference procedure are described in the Methods section. In broad strokes, the algorithm treats the unobservable firing rates of stimulus s μ_t(s) as “observations" generated from the model in Eqn. (1). To maximize the likelihood, the algorithm iterates between estimating the mean and covariance matrix parameters and calculating their posteriors based on the spike counts given their current estimates via the Monte Carlo method. This step gives an estimate of the firing rates. In the next step, we apply the functional PCA technique to the estimated firing rates for estimating the components and the moment-to-moment fluctuations.

Validation of the method

We validated our method systematically using simulated data. Inspired by previous experimental observations on tuning variability^{22,23,25,28,30,33,34,35,36}, we first examined whether our method is able to recover fPCs that correspond to multiplicative gain, additive change, tuning shift, or sharpening. Specifically, we generated synthetic data, which exhibit different types of tuning fluctuations by reverse-engineering the appropriate fPC, and tested Pf-PCA and alternative methods with these data, where the ground-truth were known.

Figure 2a shows results based on the analysis of the simulated datasets using our method and alternative methods (see “Methods” for details). We found that Pf-PCA could accurately recover the form of the fluctuations in all four cases. Furthermore, it approximately recovers the proportions of variance explained by the structured fluctuation (Fig. 2a), as well as the magnitude of the latent fluctuation on a moment-by-moment basis.

**Fig. 2: Model validation with bell-shape tuning curves.**

How does our method compare to simpler methods? Applying conventional PCA to the synthetic data, we found that it often misidentified the form of the fluctuation, and that it could not reliably estimate the magnitude of the latent fluctuations (Fig. 2b). We also applied a variant of our method, referred to as μ-PCA, by removing the smoothness constraint in our full algorithm (see “Methods” for details). This algorithm is similar to the Poisson PCA⁵⁴ (a discussion of the technical differences between μ-PCA and Poisson-PCA can be found in “Methods”). The μ-PCA generally performs better than regular PCA, but is still considerably worse than the full method Pf-PCA.

We further validated our method when multiple types of fluctuations co-exist, e.g., a combination of multiplicative gain and tuning shift. We found that Pf-PCA could recover both components reliably (see Supplementary Fig. 3), and that it drastically outperforms regular PCA and μ-PCA (see Supplementary Fig. 4). In addition, we validated our method in the case of monotonic tuning curves, e.g., the sigmoidal tuning curves, and found similar results (see Supplementary Fig. 5). Taken together, these results on synthetic data suggest that our method could robustly recover the structure and magnitude of the tuning fluctuations using an experimentally realistic amount of data.

Pf-PCA reveals power-law modulation of neural tuning

We next show that our method can be used to reveal scientific insights into neural codes. We will focus on the variability of orientation tuning in macaque V1, which has been a question of substantial interest in the past decades and may have general implications regarding the principle of neural coding in the cortex. Previous studies have mainly focused on “gain variability”, which assumes constant additive modulations or multiplicative gain that scales the whole tuning curve. The nature of this tuning variability has been heavily debated to date. Our unsupervised approach enables us to generalize the notion of “gain variability” to general “tuning variability”, resulting in a more accurate understanding of the structure of the neural response.

We analyzed seven previously published datasets, each with dozens of neurons simultaneously recorded from macaque V1^30,55 (total number of neurons = 402). During these experiments^30,55, drift gratings with different directions were presented, each for 1 or 1.28 seconds. A block-randomized design was used in these experiments, with each block sampled a pre-determined set of stimulus directions once. See “Methods” for details. To build some intuitions on orientation tuning variability, we first split the total blocks into two halves according to the number of spikes for individual neurons, and calculated the tuning curves for the high and low conditions³⁰. Figure 3a shows six representative example neurons. Visual inspections suggest that tuning variability is heterogeneous across neurons, exhibiting features consistent with an additive modulation or multiplicative gain or both, though other times neither.

**Fig. 3: Results of Pf-PCA on V1 data.**

We applied Pf-PCA to analyze the tuning fluctuations for stimulus orientation. We treated each block of stimuli as one moment, assuming that the tuning curve is stable within each block. Thus the tuning fluctuations studied here is at the timescale of ~10 s. The Pf-PCA model achieves a better fit compared to the Modulated Poisson model that assumes a multiplicative gain²⁵, assessed through cross-validated prediction error and cross-validated likelihood (see Supplementary Note 1, Supplementary Fig. 1). When applying Pf-PCA, we assumed three fPCs, which are sufficient to capture most of the tuning variability in these data (see Supplementary Fig. 6). In fact, the first fPC alone captures 62.4% of the variance on average (Fig. 3b). Below we will focus our analysis primarily based on the first fPC.

As mentioned before, if a neuron exhibits multiplicative gain change, the first fPC should be a constant (Fig. 1b). However, we found that the first fPC for the majority of neurons is not constant (for example, see Fig. 3a). This implies that the fluctuations of the firing rate of these neurons could not be accurately described as a pure multiplicative gain, and instead the gain appears to be stimulus-dependent. Interestingly, the first fPC for most neurons is highly correlated with the mean component. This is confirmed by a simple linear regression analysis between the mean component and the first fPC (Fig. 3c). For quantification, we defined a fraction to capture the percentage of the first fPC explained by the simple linear relationship (see “Methods” for details), and found that this linear relationship explains most of the information in the first fPC (Fig. 3d). We wondered if our estimation procedure might exhibit systematic biases so that even when the ground truth model was a simple gain modulation model, the estimated first fPC might nonetheless be correlated with the mean component. We performed a control analysis and found that this is unlikely. Specifically, we simulated datasets from a multiplicative gain model with approximately matched statistics and performed Pf-PCA on the synthetic datasets (see “Methods” for details). The results showed that the slope values of the regression for the synthetic data are much closer 0 compared to what we obtained from the V1 data (Fig. 3e).

Crucially, the above observation (linear relationship between the mean tuning curve and the first fPC) has conceptually important implications for the tuning structure. In particular, the linear relationship permits the following linear approximation for the first fPC ϕ₁(s),

$${\phi }_{1}(s)\,\approx \,b+wf(s).$$

(2)

Together with Eqn. (1) and some algebraic manipulations, we found that the tuning curve for moment t can be expressed as

$${\mu }_{t}(s)\, \approx \,\exp (b{\alpha }_{1,t}){[\exp \{f(s)\}]}^{1+w{\alpha }_{1,t}}.$$

Because the latent variable α_1,t appears in the exponent, it suggests that the fluctuations of the tuning curves could be in fact described as a power-law modulation, with the exponent of the power function varying from moment to moment.

Power-law modulation accounts for both additive modulation and multiplicative gain

Previous studies have proposed two forms of gain change in V1^25,28,29,30, i.e., additive and multiplicative. It has been heavily debated which type of variability is more appropriate to describe the V1 activity, or whether both types of activity co-exist in V1. We hypothesize that the part of the controversy is due to the restrictive notion of gain variability in previous studies. By considering and analyzing general tuning variability as enabled by Pf-PCA, below we will demonstrate that the power-law relation unifies these different forms of gain variability.

Noticing ${\mu }_{0}(s)=\exp \{f(s)\}$ (and assuming that it is already normalized to have peak activity equal to 1 by absorbing into the intercept term b), the tuning curve on each moment can be re-expressed as

$${\mu }_{t}(s)={\mu }_{0}{(s)}^{1+w{\alpha }_{1,t}}\exp (b{\alpha }_{1,t}).$$

(3)

Equation (3) is a power function with the power 1 + wα_1,t and the scale bα_1,t, both of which are linear function of the fluctuation α_1,t for moment t. In this relation, for each neuron, there are two free parameters corresponding to the slope and intercept in the regression analysis respectively. Without loss of generality, we can constrain the intercept to be always non-negative. The consequence of varying each parameter on the tuning is straightforward to see. Specifically, a non-zero intercept would lead to fluctuation of the peak firing rate, while a non-zero slope would lead to systematic tuning width change due to the exponentiation (Fig. 4a). Depending on the specific combination of the slope (w) and intercept (b), the tuning fluctuation will exhibit different characteristics for individual neurons.

**Fig. 4: Power-law modulation accounts for both additive and multiplicative modulations.**

First, when the slope w = 0, the power-law modulation degenerates to a pure multiplicative gain²⁵. Second, the power-law modulation could lead to approximately additive modulation with certain combinations of the slope (w) and intercept (b). For quantification, we defined a “flatness” index to characterize the change over the stimulus variable induced by the fluctuation. Informally, this index computes the ratio between the change of the firing rates between the preferred and the orthogonal orientations (see “Methods” for a formal definition). With additive modulation, the flatness index is 1, while multiplicative gain leads to a flatness index of 0. When the flatness score is negative, the resulting configurations show a sharpening of the tuning curve. Figure 4 shows the “flatness” while systematically varying the two parameters (i.e., the slope and intercept). In the appropriate parameter regimes, the power-law would manifest itself as a multiplicative or an additive change (see “Methods” for details), while parameter values in between result in tuning modulation which might be interpreted as a mix of multiplicative and additive modulations²⁸.

Empirically, most of the neurons lie in between multiplicative and additive changes (Fig. 4b), thus are better characterized by the proposed power-law relation than a pure multiplicative or additive modulation. The control analysis by generating simulated data using multiplicative gain showed that the recovered slope is close to zero over intercept as expected (See Supplementary Fig. 9). Note that a previous study³⁰ found that additive and multiplicative fluctuations were anti-correlated, which could be naturally explained by our power-law model. It is also worth mentioning that a subset of neurons exhibits a mild sharpening of the tuning curve. Together, these results provide a unified account of the fluctuations of orientation tuning in V1. Although we could not rule out the possibility of two separate mechanisms (one for multiplicative gain, and one for additive modulation), our results show that a single form of fluctuation is sufficient to capture the variability, and the tuning fluctuations of individual neurons appear to lie on a continuum.

Population tuning fluctuations are low-dimensional

The dimensionality of tuning fluctuations has important implications for the mechanisms and the function of the circuit. Some studies (e.g., refs. ^28,29) implicitly assumed a rank-1 fluctuation that scales the gain of the population in a coherent manner, and found evidence suggesting that the total population activity is highly predictive of the moment-to-moment fluctuation of the response of individual neurons²⁹. Others found the coupling strength of individual neurons to the rest of the local network to be diverse⁵⁶, implying a higher dimensionality of the tuning fluctuations and a potential role of recurrent connections in shaping network responses. Recently proposed E/I-balanced network models with spatial connectivity structure⁵⁷ predicted that the population fluctuations should be low-dimensional. Finally, it has been proposed²⁷ that gain variability in V1 may serve to represent the stimulus uncertainty via sampling, a computation would generally require the gain variability to be high dimensional.

We examined the structure of the tuning fluctuation at the population level. As demonstrated earlier, for each neuron, the tuning fluctuations could be well captured by the first fPC. Exploiting this observation, we approximated tuning fluctuations of a neural population by concatenating the scores for individual neurons together (number of neurons × number of blocks, Fig. 5a). Examining the correlation of the scores, we found that while most of the neurons fluctuate coherently, a small group of neurons is anti-correlated with the rest of the neurons (Fig. 5b) in some sessions (for results for all sessions, see Supplementary Fig. 10). What is the dimensionality of latent fluctuations of the neural population? If the neurons share a coherent multiplicative change or additive change²⁸, the latent fluctuation should be close to one dimensional. To assess this, we performed a standard PCA analysis on the score matrix to assess the linear dimensionality. We found that, while the fluctuation shows a low-dimensional structure (Fig. 5e), the dimensionality exceeds one.

**Fig. 5: Population structure of the tuning fluctuations: the fluctuations of V1 population as captured by the first fPC have a low dimensional structure.**

The empirically observed latent fluctuations cannot be explained by a rank-1 multiplicative or additive modulation model. To demonstrate this, we performed control analysis by generating simulated data using the rank-1 additive or multiplicative model (see “Methods” for details), and found that the resulting correlation structure of the inferred score based on the synthetic data exhibits a simpler structure (Fig. 5d, e, and Supplementary Figs. 11 and 12). The dimensionality of the scores is lower than that estimated from real data (Fig. 5f).

These results paint a more nuanced picture of the fluctuations of V1 at the neural population level. Deviating from what was suggested previously²⁸, the fluctuations of V1 neurons are not completely coherent in the anesthetized state, with subset of neurons could exhibit fluctuation at the opposite direction compared to the majority, nor can it be characterized by a rank-1 additive modulation or multiplicative gain fluctuations. These results will help further constrain and refine the network mechanisms giving rise to the tuning fluctuations in the visual cortex^57,58.

Higher neural activity barely increases, or even decreases Fisher information

So far, by applying our Pf-PCA analysis to the V1 data, we have derived a generative model of the neural activity in V1. Below, we demonstrate that this generative model is useful for characterizing various critical aspects of the neural code. In this section, we will leverage Pf-PCA to the calculation of the Fisher information to understand how the tuning fluctuation affects the information-carrying capacity of the V1 population. We focus on a local measure of the representation, i.e. Fisher information (FI), which has been important in quantifying the local property of the neural code^59,60,61,62. In the next section, we will use Pf-PCA to understand how the geometry of the neural response changes under tuning fluctuations, which represents another important aspect of the neural code. Overall, through the FI and geometry analysis, Pf-PCA enables further understandings of the local and global structure of the V1 code.

First, using the model estimated by Pf-PCA, we examined the relationship between the FI and the magnitude of neural activity for individual neuron (Fig. 6a, b). See “Methods” for the calculation of the FI. We found that this relationship differs substantially from neuron to neuron- it could be positive, negative, or flat (Fig. 6a, b). Figure 6b shows the histogram of the slope when regressing FI against the neural activity. Interestingly, the median of these slopes is close to 0 (i.e., 0.001). We also reported the histogram of the slopes of FI-activity curve for each session in Supplementary Fig. 13. We further validated these results by performing recovery analysis using synthetic data. We found that our method could indeed faithfully recover the relationship between FI and neural activity for individual neurons given the particular sample size of the data (see Supplementary Fig. 14).

**Fig. 6: Fisher information analysis reveals that higher neural activity barely increases, or even decreases Fisher information.**

Figure 6c shows the population FI obtained by summing the FI values across all stimuli within a block and all neurons in each dataset, sorted according to the neural activity of individual blocks. Here we assume that the neurons are noise-independent conditioned on the latent fluctuations. Note that a multiplicative gain model predicts that the population FI scales proportionally with the amount of neural activity, or put it in another way, doubling the firing rate would double the population FI. However, we found that, for most sessions, the population FI is minimally affected, or decreases systematically as the neural activity increases. This is in sharp contrast with the multiplicative gain model. To quantify this, we defined a FI-modulation index (i.e., the slope of FI-activity curve). With the multiplicative gain model, this FI-modulation is exactly 1. In the data, the modulation indexes for all sessions are far smaller than 1 and in some cases negative (−0.17, −0.44, −0.07, 0.10, 0.12, 0.26, and 0.21, respectively). Recovery analysis based on synthetic data suggests that our procedure could indeed recover the relationship between the neural activity and the population FI (Fig. 6d). See “Methods” for details. These results are consistent with the study³⁰ in that both studies found that increased neural activities do not lead to substantial increase of the population FI. Meanwhile, we also noticed some subtle discrepancies between the two studies, because it has been suggested ³⁰ that there was a minimal change in the population FI when neural activity changes. We believe that the difference lies in the difference in the analysis methods (see “Methods” and Supplementary Note 5 for a detailed discussion).

Change of representational geometry induced by spontaneous fluctuation is different from contrast

Neural response naturally forms neural manifold by systematically varying the stimulus. The geometry of the encoding manifold has multiple implications in understanding the format of the representation and in linking neural responses to the behavior (reviewed in ref. ⁶³). How the tuning variability affects the geometrical properties of the encoding manifold represents an interesting yet unresolved question. To investigate this question, we begin by simulating a multiplicative gain, which is a simple scenario exhibiting how fluctuating signals are displayed by the geometric analysis. We constructed a homogeneous population code for encoding stimulus orientation with independent Poisson noise, and varying the shared multiplicative gain (see “Methods” for details). This population coding model recapitulates the basic effect of varying stimulus contrast^22,23,44. We computed the representational distance⁶⁴ as a function of the orientation disparity, and found that multiplicative gain only scales the representational distance function without changing the shape (Fig. 7c). A 3-D multi-dimensional scaling (MDS) based on the representational distance matrix shows that the neural manifold under tuning fluctuation exhibits a cone-shape (Fig. 7d), with the radial dimension encoding the multiplicative gain. When projecting onto the first two dimensions, we observed that the size of the representation for each contrast (e.g., the radius of each circle) scales with the neural activity (Fig. 7e).

**Fig. 7: Geometry analysis demonstrates that the latent tuning fluctuations lie on a different manifold compared to changing contrast.**

Next, we sought to understand how the tuning fluctuations identified by Pf-PCA from the V1 data would affect the geometry of the code^65,66. To do so, we created a neural population code based on the empirically fitted tuning curves and scores from Pf-PCA (see “Methods” for details). We clustered the score matrix into 10 clusters, then computed the average scores for each cluster to get the pattern of fluctuations corresponding to each of the 10 characteristic states. For each state, the corresponding tuning curves were generated accordingly. Analyzing the representational distance (RD) as a function of orientation disparity for the 10 latent states (shown in Fig. 7g–m, first column), we found that the RD curves are only slightly affected by the total activity. In several cases, higher activity leads to overall lower RD, e.g., (Fig. 7h). Furthermore, MDS analysis (Fig. 7g–m, second column) shows that the fluctuations cause the representation to move along a cylinder-like manifold. When projecting onto the first two dimensions, we had two observations. First, the centers of the representations corresponding to all states are aligned, suggesting that the representation “drifts”⁶⁷ in the direction is orthogonal to the representation of orientation. Second, the size of the representation only changes slightly with varying population activities (Fig. 7g–m, third column). Note that this general pattern does not resemble the cone-like structure induced by the multiplicative gain (Fig. 7d). Note that in two of seven sessions, the latent fluctuations are smaller so that the cylinder structure does not appear in the 3-D MDS, however, it becomes apparent when we plotted the first two and the fifth dimension in a 5-D MDS embedding.

These results demonstrate that the spontaneous fluctuations of neural tuning in V1 lie on a different manifold compared to that induced by the changing stimulus contrast that leads to a multiplicative gain. Note that the effect is also different from a simple additive modulation (Supplementary Fig. 16). These results have important implications for the downstream readout. If the spontaneous fluctuations lied on the same manifold as the changing stimulus contrast, the downstream would not be possible to distinguish the spontaneous fluctuations from a change of contrast. Our results argue against that scenario, and further suggest that the latent fluctuations mostly cause a “drift” of the representation without fundamentally changing the fidelity and the structure of the representation^67,68,69.

Discussion

We have presented a flexible unsupervised approach, Pf-PCA, to analyze the tuning variability. This approach provides a general framework to understand how the observed stimulus variables and latent factors together influence the neural activity. Specifically, it decomposes the tuning curves as the sum of the mean component and the fPCs which are tuned to the stimulus, and are subject to the modulation of the latent factors. We demonstrated that Pf-PCA could robustly and reliably recover the structure of the fluctuations given a few dozens of blocks of data. We applied our method to analyze the spike train data collected from anesthetized macaque V1 while viewing drift gratings, and discovered several insights regarding the structure of the orientation code.

Our method represents a more flexible modeling framework compared to previous work in analyzing the tuning variability. Previous models often presumed the form of fluctuation (e.g. refs. ^25,28,30,31), and the fluctuation was often assumed to be a constant, acting onto the tuning curve through either multiplicative or additive interactions. Thus, the forms of fluctuations captured by these analyses were limited by construction. Our method, instead, allows unsupervised discovery of arbitrary smooth tuning fluctuation, and potentially multiple forms of fluctuations simultaneously. Our method is broadly applicable, so long as the neurons have smooth tuning over certain stimulus dimension, which could be spatial frequency⁷⁰, location^17,71, direction^72,73, or time⁷⁴. A potentially fruitful venue of using our approach would be to leverage our approach to test computational models by analyzing the data simulated from these models to identify the structure of the latent fluctuations, and comparing these predictions with the structure extracted from the data.

Our results suggest that the tuning fluctuation exhibits low-rank structure, both at the levels of individual neurons and neural populations. The latter is generally consistent with, and generalizes from, results in previous studies that assumed coherent gain fluctuations among simultaneously recorded neurons^26,28,29. A recent study⁷⁵ found that the variance explained by the PCs of large-scale neural populations scaled as a power-law. Our results are different from theirs: (i) our results concern about the dimensionality of tuning variability, not the dimensionality of stimulus tuning; (ii) we primarily focus on the amount of variance explained by the top PCs, not the properties of the tail of the spectrum as done in ref. ⁷⁵. With a few dozens of simultaneously recorded neurons, we can not accurately estimate the scaling relationship between the variance (of the tuning variability) explained of the neural population and the number of PCs—an interesting question that could be addressed in the future with larger datasets.

We have focused on an exponential non-linearity for the link function, which has been assumed by many previous models^76,77,78,79. It should be possible to further extend it to other types of non-linearity⁸⁰, such as a power-law transformation^32,44,81. It would also be interesting for future research to develop techniques that could automatically infer the type of non-linearity from the data directly.

Our V1 results should be informative to a better mechanistic and functional understanding of the V1. Naively, assuming an exponential non-linearity, the power-law modulation revealed by our analysis could be explained by a tuned input to a given neuron which fluctuates over time. However, this is likely a simplified picture. It would be more fruitful to consider how threshold non-linearity together with noise could lead to these kind of results. Previously, models of this kind^81,82,83 have been used to account for the multiplicative gain on the tuning curve induced by varying contrast. Second, the finding that the latent fluctuations are heterogeneous in the population is consistent with the idea that the recurrent processing in V1 may play an important rule in shaping the structure of the fluctuations of neural tuning. These results echo with recent work⁵⁷ showing that spatially patterned fluctuation structure could emerge in balance networks in V1 in which neural fluctuations can be heterogeneous. Third, it is interesting to consider the implications of our observations in the context of the functional models of neural variability. Such variability has been proposed to reflect sampling of the sensory inputs⁸⁴, encoding stimulus uncertainty²⁷, and efficient encoding of natural scene statistics^85,86,87. The specific structure of the latent fluctuations extracted by Pf-PCA provides a richer set of summary statistics to further test these current mechanistic and functional models and help developing future models.

Our method enables us to further analyze the coding properties in the presence of the tuning fluctuations, both in terms of local properties (via FI) and the global geometrical structure of the code under tuning fluctuations. We found that FI generally does not substantially increase (sometimes even decrease) with increased neural activity. This may point to the potential importance of cortical inhibition in sharpening the neural code⁸⁸. The analysis of the geometry reveals that the manifold induced by the latent fluctuation lies in different subspace of changing contrast. This suggests that the tuning fluctuations in V1 may not interfere with encoding of contrast. These observations deserve further investigations in the future.

Our V1 results are entirely based on analyzing neural responses at the anesthetized state. To the extent by which the structure of noise fluctuations under the anesthetized states resembles those of the awake-behaving animals remains an open problem. Earlier work using voltage-sensitive dye to measure large-scale activity fluctuation in V1 under anesthesia found that the structure of the spontaneous fluctuations resembled the stimulus-driven activity, and they interacted with stimulus-evoked activity in an additive fashion^5,25,89, and reported that slow gain fluctuations identified in the anesthetized macaque were also present in the awake state. In addition, results in ref. ³⁰ found that the additive and multiplicative change of the tuning curves were also present in a smaller dataset from one macaque monkey. Nonetheless, anesthesia can change the properties of neuron integration in cortical neurons⁹⁰, and may trigger a profound change of the cortical dynamics⁹¹ and coding⁹². Detailed in-depth investigations in the future will be important to determine whether the rule of V1 tuning variability that we discovered from anesthetized states may generalize to the awake states. One further limitation of the anesthetized data is that it precludes the analysis of the latent tuning fluctuations with behavior. It would be interesting to see if the fluctuations of internal states similar to what we found here correspond to a change of the behavior⁶⁹.

A few limitations and the potential improvement of our results are worth mentioning. Our current method does not explicitly model the temporal structure of the tuning fluctuation, as α_t is assumed to be independent, and is estimated for each t. It should be possible to improve our method by leveraging temporal smoothness prior on the scores, e.g., by weighted Poisson fPCA where the weights are constructed by using a temporal kernel, or assuming a Gaussian process prior^{93,94,95,96,97,98}—a direction we did not pursue here, but would be an interesting future direction. Also, our method, when applying to V1 data, only deals with slow fluctuations (~10 s)²⁵ (see Supplementary Fig. 8), because of the assumption that the latent is the same within every block (or moment). Thus, this inferred moment-to-moment tuning fluctuation is at the time scale of ~10 s. Tuning variability at an even faster time scale would be averaged out. Thus our estimate of the tuning fluctuation is likely an under-estimate of the true fluctuations. It should be possible to refine these estimates and study tuning variability at an even faster scale. Two approaches may be promising: (i) by using faster stimulus sampling in experiments–with a stimulus sampling of 100 ms per stimulus, it is possible to apply the same approach to study tuning variability at a timescale of ~1 s; (ii) by extending our methods to fit the neural population all together. For the latter, assuming a low dimensional latent structure, it should be possible to infer the latent fluctuation based on individual stimulus—a direction we are currently pursuing.

In summary, we have developed a statistical approach to parse the variability of neural tuning. Our approach can flexibly capture the impact of both stimulus variable and latent variable on a moment-by-moment basis. Applying our approach to macaque V1 revealed the structure of the tuning variability both at the level of individual neuron and the neural population. Our analyses also led to further insights on the FI and geometry of the code. While we only analyzed the orientation code for V1 in the paper, we hope that the analysis pipeline developed here would be informative for elucidating the structure of neural tuning and response variability in other neural systems as well⁹⁹.

Methods

Poisson functional principal component analysis (Pf-PCA): generative model

A standard model description of neural responses in neuroscience is based on tuning curves and (typically) Poisson spiking noise. Specifically, the observed spike count n(s) of stimulus s for individual neuron, during a counting window of length Δ_t, is modeled as a Poisson distribution

$$p(n(s)|\mu (s){{{\Delta }}}_{t})=\frac{{(\mu (s){{{\Delta }}}_{t})}^{n(s)}}{n(s)!}\exp \left(-\mu (s){{{\Delta }}}_{t}\right).$$

(4)

where μ(s) represents the tuning curve of stimulus s.

The tuning curve may vary among B moments (i.e., blocks of trials) and is not directly observed. Denoting the tuning curve as μ_t(s) for moment t, t = 1, ⋯ B, we model the log of the stochastic curve, $\log ({\mu }_{t}(s))$, as following:

$$\log ({\mu }_{t}(s))=f(s)+\mathop{\sum}\limits_{k}{\alpha }_{k,t}{\phi }_{k}(s)+{\epsilon }_{t}(s).$$

(5)

Here f(s) is the mean component, ϕ_k(s) is the k-th functional principal component (fPC), α_k,t denotes the amount of fluctuation (i.e., score) for the k-th component during the t-th moment, and is assumed to follow a zero-mean Gaussian distribution, with variance ${\sigma }_{k}^{2}$, and {ϵ_t(s)} are independent and identically distributed zero-mean Gaussian noise with variance ${\sigma }_{0}^{2}$ that is a parameter to quantify the remaining variance of $\log ({\mu }_{t}(s))$ on {ϕ_k(s)}. Note that this is the same model as described in Eq. (1) in the main text, in which dependence of the individual terms on s was suppressed for simplicity. This model implies that $\log ({\mu }_{t}(s))$ has the mean ${{{{{{{\rm{E}}}}}}}}[\log ({\mu }_{t}(s))]=f(s)$ and the co-variance function ${\sum }_{k}{\sigma }_{k}^{2}{\phi }_{k}({s}_{1}){\phi }_{k}({s}_{2})+{\sigma }_{0}^{2}$.

Note that with only the first term f(s), this model is equivalent to the standard tuning curve model of spike counts. The second term ∑_kα_k,tϕ_k(s) is the additional variance from the contribution of moment-to-moment fluctuations, making the model naturally capture the over-dispersion of spike counts.

Pf-PCA differs from the multiplicative gain model proposed in ref. ²⁵. First and most importantly, we do not assume a pure multiplicative gain change in the fluctuations, instead, the form of fluctuations is arbitrary. Second and a more subtle point is that, in Pf-PCA, the magnitude of the fluctuation α is assumed to follow a Gaussian distribution, while Gamma distribution was assumed for the gain in the firing rate scale (not the logarithm of firing rate) in ref. ²⁵. Note that α_k,t is Gaussian distributed thus symmetric, while the logarithm of Gamma distribution is left-skewed thus has a different shape.

Inference: a two-step estimation procedure

Assume that we have observations based on a set of m stimuli which are sampled from a particular stimulus space ${{{{{{{\mathcal{S}}}}}}}}$. The spike count of individual neuron elicited by individual stimulus s_j, j = 1, ... , m, is denoted as n_t(s_j) for the t-th moment (i.e., the t-th block of trials). We further denote the spike count vector for the t-th moment as ${\overrightarrow{n}}_{t}={({n}_{t}({s}_{1}),\cdots,{n}_{t}({s}_{m}))}^{\top }$.

Intuitively, if we could recover the unobservable mean μ_t(s_j) for t = 1, ... , B, j = 1, ... , m, fitting the model of stochastic curve $\log ({\mu }_{t}(s))$ using functional PCA would be straight-forward. Denote ${\overrightarrow{\mu }}_{t}={({\mu }_{t}({s}_{1}),\cdots,{\mu }_{t}({s}_{m}))}^{\top }$. We could estimate the posterior of the hidden firing rate $\log ({\overrightarrow{\mu }}_{t})$ from the spike count data using an expectation-maximization (EM) algorithm¹⁰⁰. Following these ideas, we developed a two-step estimation procedure, as follows.

Step 1: recover the hidden ${\overrightarrow{\mu }}_{t}$

When the vector $\log ({\overrightarrow{\mu }}_{t})$ is observable, the likelihood of Poisson model can be written as

$${\ell }_{p}=\mathop{\sum}\limits_{t}\mathop{\sum}\limits_{j}\left\{\frac{{[{\mu }_{t}({s}_{j})]}^{{n}_{t}({s}_{j})}}{{\mu }_{t}({s}_{j})!}\exp \left[-{\mu }_{t}({s}_{j})\right]\right\}.$$

(6)

Eqn. (5) implies that the logarithm of the firing rate for the sampled stimulus set {s₁, ⋯ , s_m} during the t-th moment $\log ({\overrightarrow{\mu }}_{t})$ can be modeled as $\log ({\overrightarrow{\mu }}_{t}) \sim N(\overrightarrow{f},{{{{{{{\boldsymbol{\Sigma }}}}}}}})$, where ${{{{{{{\boldsymbol{\Sigma }}}}}}}}={\sum }_{k}{\sigma }_{k}^{2}{\overrightarrow{\phi }}_{k}{\overrightarrow{\phi }}_{k}^{\top }+{\sigma }_{0}^{2}{{{{{{{\bf{I}}}}}}}}$. We then obtain the log-likelihood

$${\ell }_{g}=\mathop{\sum}\limits_{t}{\left(\log ({\overrightarrow{\mu }}_{t})-\overrightarrow{f}\right)}^{\top }{{{{{{{{\mathbf{\Sigma }}}}}}}}}^{-1}\left(\log ({\overrightarrow{\mu }}_{t})-\overrightarrow{f}\right)+\log|{{{{{{{\mathbf{\Sigma }}}}}}}}|.$$

(7)

In reality the firing rates are not directly observed. However, by treating $\log ({\overrightarrow{\mu }}_{t})$ as missing data, we could use an EM algorithm to iterate between an E-step and a M-step to optimize the functions. Specifically, the E-step calculates the conditional mean ${{{{{{{\rm{E}}}}}}}}[\log ({\overrightarrow{\mu }}_{t})|{\overrightarrow{n}}_{t}]$ and conditional variance ${{{{{{{\rm{Cov}}}}}}}}[\log ({\overrightarrow{\mu }}_{t})|{\overrightarrow{n}}_{t}]$, given the current estimates of the parameters $\hat{\overrightarrow{f}}$ and $\hat{{{{{{{{\mathbf{\Sigma }}}}}}}}}$ obtained in M-step. Given the expectations obtained in the E-step, the M-step maximizes ℓ_g in Eq. (7), which involves the following two quantities:

$$\hat{\overrightarrow{f}}= \frac{1}{B}\mathop{\sum }\limits_{t=1}^{B}{{{{{{{\rm{E}}}}}}}}[\log ({\overrightarrow{\mu }}_{t})|{\overrightarrow{n}}_{t}],\\ \hat{{{{{{{{\mathbf{\Sigma }}}}}}}}}= \frac{1}{B}\mathop{\sum }\limits_{t=1}^{B}\left[{{{{{{{\rm{Cov}}}}}}}}[\log ({\overrightarrow{\mu }}_{t})|{\overrightarrow{n}}_{t}]+\left[{{{{{{{\rm{E}}}}}}}}[\log ({\overrightarrow{\mu }}_{t})|{\overrightarrow{n}}_{t}]-\hat{\overrightarrow{f}}\right]{\left[{{{{{{{\rm{E}}}}}}}}[\log ({\overrightarrow{\mu }}_{t})|{\overrightarrow{n}}_{t}]-\hat{\overrightarrow{f}}\right]}^{\top }\right].$$

Note that calculating these expectations in the E-step requires the marginal distribution ${\overrightarrow{n}}_{t}$, which is not analytically tractable. We thus adopt a Monte Carlo approach to calculate them. For each t, we generate a set of samples, $\log {({\overrightarrow{\mu }}_{t})}^{*1},\cdots \,,\log {({\overrightarrow{\mu }}_{t})}^{*M}$, where M = 10, 000 is the number of Monte Carlo runs, according to the distribution of $\log ({\overrightarrow{\mu }}_{t})$ given by the current parameters. Then the unbiased estimates are obtained from the samples that we simulated. Together, Step 1 gives an estimator of hidden means ${\overrightarrow{\mu }}_{t}$ in the form of ${{{{{{{\rm{E}}}}}}}}[\log ({\overrightarrow{\mu }}_{t})|{\overrightarrow{n}}_{t}]$.

Step 2: perform functional PCA on the recovered hidden ${{{{{{{\rm{E}}}}}}}}[\log ({\overrightarrow{\mu }}_{t})|{\overrightarrow{n}}_{t}]$

Given the posterior ${{{{{{{\rm{E}}}}}}}}[\log ({\overrightarrow{\mu }}_{t})|{\overrightarrow{n}}_{t}]$, we then apply the method of functional data analysis into the estimated posterior means to get the mean component f(s), the fPCs {ϕ_k(s)}, and their corresponding scores α_k,t. Specifically, f(s) is obtained by using the natural cubic splines smoothing approach

$${\ell }_{s}=\mathop{\sum }\limits_{t=1}^{B}{\left|{{{{{{{\rm{E}}}}}}}}[\log ({\overrightarrow{\mu }}_{t})|{\overrightarrow{n}}_{t}]-\overrightarrow{f}\right|}^{2}+\lambda \int{\left(\frac{{\partial }^{2}f(s)}{\partial {s}^{2}}\right)}^{2}ds,$$

where λ is chosen via generalized cross validation¹⁰¹.

The functional fluctuations {ϕ_k(s)} are estimated by the roughness of the eigenfunction^52,102. The first component ${\overrightarrow{\phi }}_{1}$ and the corresponding score for each moment ${\hat{\alpha }}_{1,t}$ are estimated via

$$ \mathop{\max }\limits_{{\phi }_{1}(s)} \frac{{{{{{{{\rm{var}}}}}}}}\int{\phi }_{1}(s){{{{{{{\rm{E}}}}}}}}[\log ({\overrightarrow{\mu }}_{t})|{\overrightarrow{n}}_{t}]ds}{1+\lambda \int{({\phi }_{1}^{{\prime\prime} }(s))}^{2}ds}\,{{{{{{{\rm{subject}}}}}}}}\,{{{{{{{\rm{to}}}}}}}}\,\int{({\phi }_{1}(s))}^{2}ds=1,\\ {{{{{{{\rm{and}}}}}}}} \,{\hat{\alpha }}_{1,t}=\int{\phi }_{1}(s)[{{{{{{{\rm{E}}}}}}}}[\log ({\overrightarrow{\mu }}_{t})|{\overrightarrow{n}}_{t}]-f(s)]ds.$$

(8)

The remaining components and their scores are obtained via an iterative process such that any higher order eigenfunction is orthogonal to the eigenfunctions already recovered. This procedure allows us to estimate the variance explained by each fPC, as quantified by the variance of the score for that component. The proportion of variance explained by the k-th fPC ${\overrightarrow{\phi }}_{k}$ is simply calculated by ${{{{{{{\rm{var}}}}}}}}({\hat{\alpha }}_{k,t})/{\sum }_{{k}^{{\prime} }}{{{{{{{\rm{var}}}}}}}}({\hat{\alpha }}_{{k}^{{\prime} },t})$.

Implementation of a reduced version of the method: μ-PCA

We also implement a reduced version of the method, μ-PCA. For this method, after obtaining the posterior mean ${{{{{{{\rm{E}}}}}}}}(\log ({\overrightarrow{\mu }}_{t})|{\overrightarrow{n}}_{t})$, we perform regular PCA directly on the exponential function of the estimated posterior mean, $\exp [{{{{{{{\rm{E}}}}}}}}(\log ({\overrightarrow{\mu }}_{t})|{\overrightarrow{n}}_{t})]$, instead of functional PCA,

μ-PCA can be thought as an alternative way to perform PCA to Poisson count data compared to Poisson PCA⁵⁴. First, Poisson PCA⁵⁴ considered $\log ({\mu }_{t}(s))$ as “natural parameters”, which define the mean and components. In contrast, μ-PCA considers the unobserved $\log ({\mu }_{t}(s))$ as random variables, and decomposes it into the mean and the principal components with the amount of fluctuations assumed to be Gaussian. Second, different techniques are proposed to obtain the principal components. Following the relation between log-likelihood of exponential family and Bregman distance, Poisson PCA⁵⁴ constructed a loss function to optimize it, and then obtained the principal components. The μ-PCA method seeks to estimate the posterior mean ${{{{{{{\rm{E}}}}}}}}(\log ({\overrightarrow{\mu }}_{t})|{\overrightarrow{n}}_{t})$ instead of random variables ${\overrightarrow{\mu }}_{t}$.

Validation of the methods using simulated data

We validated our methods using simulated data inspired by experimental observations^30,55,103 and with ground truth. To do so, we generated tuning curve $\mu (s)=0.5+\frac{5}{{f}_{{{{{{{{\rm{G}}}}}}}}}(0)}{f}_{{{{{{{{\rm{G}}}}}}}}}(\frac{s}{20})$ where f_G is the density function of standard normal distribution. The stimulus s takes a sequence with a 22.5 degree interval from −90 to 90 degree. The number of moments (blocks) B was set to be 50 throughout.

For each of the four types of tuning changes, i.e., multiplicative gain, additive modulation, tuning shift, and tuning sharpening, we reverse-engineered the fPC $\overrightarrow{\phi }$ that would give rise to that type of tuning fluctuations. More concretely, the tuning fluctuations, denoted by ${\overrightarrow{\gamma }}_{t}$, are generated from Gaussian with covariance structure ${{{{{{{\boldsymbol{\Sigma }}}}}}}}={\sigma }_{1}^{2}\overrightarrow{\phi }{\overrightarrow{\phi }}^{\top }+c{{{{{{{\bf{I}}}}}}}}$, where c is chosen such that the structured component explains 80% of the variance. The second component amounts to white noise, accounting for 20% of the variance. Adding this random noise component allows us to test the robustness of the model in the presence of less structured fluctuations, as well as to evaluate the extent to which our estimation procedure could recover the variance explained by the structured fPC. Note that the recovery problem becomes easier without such random noise component. From the simulated data, we first estimated the variance explained by the K fPCs (K = number of stimuli) using Pf-PCA, and then computed the proportion of variance explained by the first fPC as ${{{{{{{\rm{var}}}}}}}}({\hat{\alpha }}_{1,t})/{\sum }_{{k}^{{\prime} }}{{{{{{{\rm{var}}}}}}}}({\hat{\alpha }}_{{k}^{{\prime} },t})$. Effectively, we relied on the last K − 1 components to recover the random component in the generative model.

Here ${\mu }_{0}(s)=\exp (f(s))$ represents the tuning curve with no fluctuation, and ${\sigma }_{1}^{2}$ denotes the the strength of fluctuation. Due to the scale difference among different fluctuations, the parameter ${\sigma }_{1}^{2}$ is set as follows. Multiplicative gain: ${\sigma }_{1}^{2}=1.25$, so that its gain change is $\log (1.3{\mu }_{0}(s))-\log (0.9{\mu }_{0}(s))$. Additive modulation: ${\sigma }_{1}^{2}=5.5$, so that its response change is $\log [{\mu }_{0}(s)+0.4]-\log [{\mu }_{0}(s)-0.2]$. Tuning shift: ${\sigma }_{1}^{2}=1.38$, so that it shifts $\log ({\mu }_{0}(s+6))-\log ({\mu }_{0}(s-6))$. Tuning width change: ${\sigma }_{1}^{2}=1.85$, so that its standard deviance changes ± 20%. Under these settings, we generated Poisson spike count data with firing rates ${\mu }_{t}(s)=\exp [\log ({\mu }_{0}(s))+{\gamma }_{t}(s)]$.

Note that we also validated our methods (i) when multiple forms of fluctuations co-exist simultaneously; (ii) when the tuning curves are monotonic^104,105. These procedures and their results can be found in Supplementary Notes 2 and 3.

Analyzing data from macaque V1 using Pf-PCA

We used 7 datasets (i.e., 7 sessions) which were published previously. Three of them (D5-D7), publicly available from CRCNS website, were obtained from anesthetized macaque primary visual cortex by Matthew Smith and Adam Kohn⁵⁵. In these experiments, described in details in refs. ^55,103, spiking activities were recorded while presenting different grayscale visual stimuli, including drifting sinusoidal gratings (each presented for 1.28 s). These gratings are [0, 30, 60, ⋯ , 330] deg. The other four sessions (D1-D4) were previously published in³⁰, shared by the authors. These data also visually evoked activities from anesthetized macaque primary visual cortex (see ref. ³⁰ for details). The grating directions are [0, 22.5, 45, ⋯ , 157.5] deg. We chose neurons with SNR≥2 and mean firing rate ≥1.5 spikes/second. In total, we analyzed 7 datasets with 402 neurons.

For each stimulus, we counted the number of spikes for a 500 ms window (80–580 ms after stimulus onset). Because the experiments had a block-randomized design, for each block we obtained a response vector corresponding the responses for all the stimulus orientations sampled in the experiments. Repeating this for every block, we constructed a spike count matrix for each neuron (number of blocks × number of orientations).

We then applied Pf-PCA to this matrix for each neuron. In doing so, we obtained the mean component f(s), the fPCs ϕ_k(s), where k is component index, as well as the amount of fluctuations, i.e., scores, α_k,t for each moment t. We set the number of fPCs to be to three, as three fPCs could already sufficient to account for most of the variance (see Supplementary Fig. 6). The results reported in Fig. 3 were obtained based on hundreds of blocks of data (400 for the D1-D3, 200 for the D5-D7). To examine the impact of sample size, we ran Pf-PCA on subsets of V1 data by taking 25, 50, or 100 blocks of each dataset. See the results in Supplementary Note 4 and Supplementary Fig. 7.

In Fig. 3a, we plotted the inferred mean component in the form of $\exp (f(s))$, and the first fPC in the form of $\exp (f(s)\pm \sigma {\phi }_{1}(s))$, where σ is the s.d. of estimated α_1,t.

Regression analysis

We analyzed the relationship between ϕ₁(s) and f(s) by performing a regression analysis with the following form

$${\phi }_{1}(s)=b+wf(s)+e(s).$$

(9)

The regression is done by using the “lm” function in scientific computing software R. First, we tested the significance of the regression by a F-test (Fig. 3c). To quantify how much information of ϕ₁(s) is accounted for by a linear function of f(s), we defined a summary statistics: ${\mathtt{fraction}}=1-\frac{{\sum }_{s}e{(s)}^{2}}{{\sum }_{s}{\phi }_{1}{(s)}^{2}}$. This measure was reported in Fig. 3d.

“Flatness index” analysis

In Fig. 4a, ${\mu }_{0}(s)=\exp (f(s))$ was generated from von Mises function with parameters satisfying μ₀(s) ∈ [0.2, 1]. Denote the tuning curve corresponding to the fluctuation α = α₀ as ${\mu }_{{\alpha }_{0}}(s)$. Define ${{\Delta }}\mu (s)={\mu }_{\alpha }(s)-{\mu }_{0}(s)-c(\exp (b\alpha )-1)$, where c is the baseline of μ₀(s). Thus, Δμ(s) captures the change of firing rate as a function of the stimulus with an additional correction term. The “flatness" index was defined as $\frac{{{\Delta }}\mu ({s}^{{{{{{{{\rm{orth}}}}}}}}})}{{{\Delta }}\mu ({s}^{{{{{{{{\rm{pref}}}}}}}}})}$, where s^pref and s^orth denote the preferred orientation of the neuron and its orthogonal orientation, respectively.

This index quantifies how flat Δμ(s) is. For additive change, μ_α(s) = μ₀(s) + c_addα, where c_add is a constant, and ${{\Delta }}\mu (s)={c}_{{{{{{{{\rm{add}}}}}}}}}\alpha -c(\exp (b\alpha )-1)$, implying Δμ(s) is completely flat over s. Thus, flatness = 1 in this case. For multiplicative gain, ${\mu }_{\alpha }(s)=\exp (b\alpha ){\mu }_{0}(s)$ and ${{\Delta }}\mu (s)=({\mu }_{0}(s)-c)(\exp (b\alpha )-1)$, implying Δμ(s^orth) = 0. Thus, flatness = 0. Remarks: flatness can be larger than 1 when Δμ(s^orth) > Δμ(s^pref), which is possible when Δμ(s) < 0. It is also possible to have flatness < 0, when Δμ(s^orth) < 0 < Δμ(s^pref).

Connecting the power-law relation to multiplicative gain and additive modulation

The power-law modulation can degenerate to multiplicative gain and additive modulation under certain parameter regime. Obviously, as w = 0, the power-law modulation is equivalent to multiplicative gain. The connection to additive modulation is less obvious. When bα and wα are close to 0, using Taylor expansion, ${\mu }_{\alpha }(s)\, \approx\, {\mu }_{0}(s)+{\mu }_{0}(s)(b+w\log {\mu }_{0}(s))\alpha$. It follows that when the function ${\mu }_{0}(s)b/w+{\mu }_{0}(s)\log {\mu }_{0}(s)$ is flat over s, the power-law modulation degenerates to an additive change. Examining the property of the function $g(x)=x\log (x)-\kappa x$, we found that g(x) is indeed approximately flat over [0, 180] when κ is in some region, resulting in an approximate additive modulation.

Control analysis

Simulated data from multiplicative gain model

To generate synthetic data from the multiplicative gain model, we set the tuning curve μ(s) on each moment to be $\log$(mean firing rate) plus a constant fluctuation, with its standard deviation matching that inferred from the real data. We then sampled the spike count under Poisson noise. In doing so, we generated synthetic data which approximately match the amount of fluctuations in the real data, but with a pure multiplicative gain. We performed regression analysis on the simulated data using the same procedure as that was used for real data (described above). The slope values obtained for the real and synthetic data were compared. The results were reported in Fig. 3e.

Rank-1 model

In Fig. 5c, d, f and Supplementary Fig. 11, we reported the recovered score matrix based on a Rank-1 multiplicative gain model and the dimensionality. When simulating this model to generate synthetic data, the multiplicative gain fluctuations were sampled i.i.d. from a normal distribution. To ensure comparability of results, we set the variance in a way such that the fluctuations of each simulated neuron match the values of real neural data. The fluctuation at each moment was shared by all neurons in the population, ensuring the score matrix is rank-1. We performed the Pf-PCA analysis on the simulated population, then obtained the recovered score matrix. We also simulated and analyzed synthetic data from a rank-1 additive-modulation model. These results were reported in Supplementary Fig. 12.

Fisher information

Assume that the tuning curve for neuron i is μ⁽ⁱ⁾(s) and that the spike count n⁽ⁱ⁾(s) follows Poisson distribution with mean μ⁽ⁱ⁾(s). Because we can approximate μ⁽ⁱ⁾(s) by the mean f(s) plus the functional fluctuations, the Fisher information (FI) of neuron i at stimulus s, given the score α_k, where k is index of component, is obtained by

$${I}^{(i)}(s)= \frac{\partial {\mu }^{(i)}(s)}{\partial s}E\left[-\frac{{\partial }^{2}[{n}^{(i)}\log ({\mu }^{(i)})-{\mu }^{(i)}]}{{(\partial {\mu }^{(i)})}^{2}}\right]\frac{\partial {\mu }^{(i)}(s)}{\partial s}\\= \left[{\mu }^{(i)}(s)\left(\frac{\partial {f}^{(i)}(s)}{\partial s}+\mathop{\sum}\limits_{k}{\alpha }_{k}\frac{\partial {\phi }_{k}^{(i)}(s)}{\partial s}\right)\right]\frac{1}{{\mu }^{(i)}(s)}\left[{\mu }^{(i)}\left(\frac{\partial {f}^{(i)}(s)}{\partial s}+\mathop{\sum}\limits_{k}{\alpha }_{k}\frac{\partial {\phi }_{k}^{(i)}(s)}{\partial s}\right)\right]\\= {\mu }^{(i)}(s){\left[\frac{\partial {f}^{(i)}(s)}{\partial s}+\mathop{\sum}\limits_{k}{\alpha }_{k}\frac{\partial {\phi }_{k}^{(i)}(s)}{\partial s}\right]}^{2}.$$

(10)

To compute the population Fisher information, we assumed that the neurons are independent conditioned on the fluctuations. To compute the population FI for each stimulus, we summed over the neurons in the population. Note that the reported FI for neural population or individual neurons (Supplementary Fig. 14) is the total FI (Fig. 6) by summing over different stimulus orientations with an individual experimental block.

Recovery analysis on FI

To see if our method indeed has the statistical accuracy to recover the relation between FI and spike activity, we performed a control recovery analysis. We first generated synthetic datasets by simulating data based on the Pf-PCA model with the parameter values estimated from the real data. Specifically, for a neuron we considered the Poisson mean μ_t(s) of moment t to be $\log ({\mu }_{t}(s))=f(s)+{\alpha }_{1,t}{\phi }_{1}(s)$, and generated the counts of moment t from Poisson with the mean. From this, we obtained the synthetic population counts. We then performed the same analysis pipeline on these synthetic data to estimate the population FI. From this control analysis, we found that our method can accurately recover the relationship between FI and total spiking activity.

FI and classification analysis

We performed classification analysis similar to ref. ³⁰ to examine the relation between the population FI and classification accuracy. Similar to ref. ³⁰, we split the data into two groups (i.e., high and low), sorted by the population activity. We performed classification based on ensemble with different size. Given a randomly selected ensemble of neurons with certain size, we performed multinomial logistic regression, and obtained the performance (proportion of correct classification). For avoiding over fitting the data, we used 5-fold cross-validation and reported the average performance across the five sets of left-out data. For each ensemble size, we performed this analysis on 500 randomly selective groups for high and low group each. These results were reported in Supplementary Fig. 15a. We also performed this analysis on the synthetic data as described above. The results were shown in Supplementary Fig. 15b.

Analysis of representational geometry

We analyzed the geometry of the representation under a simple multiplicative gain model and the power-law model inferred from the V1 data.

For the simple multiplicative gain model recapitulating the effect of changing contrast, we generated a homogeneous set of tuning curves using von Mises function (tuning width parameter equals 1). We assumed that the multiplicative gain modulated the firing rate of all neurons in the same way. In Fig. 7a–e, we assumed that the multiplicative gain could take four different levels (0.25, 0.5, 0.75,1), and computed the representation distance matrix by evaluating the representational distance for each pair of states (defined by both stimulus orientation and multiplicative gain). We performed three-dimensional classic MDS to visualize the geometrical structure of the representation, and obtained the projection onto the first two dimensions. A similar analysis was performed based on pure additive change (for results, see Supplementary Fig. 16).

For the models based on Pf-PCA inferred from real data (Fig. 7f–l), we performed the geometry analysis by the following steps:

(i)
We first generated the mean firing rates for each moment t by from our power-law modulation model. We clustered the blocks × neuron score matrix into 10 clusters according to blocks by k-mean, then computed the average score within clusters to get the “10-state averaged score", which is 10 × neuron matrix. For each of the 10 states in the population, the corresponding tuning curves were generated.
(ii)
To reduce the biased sampling of neurons, we created a more shift-invariant neural population code by shifting tuning curves 8 times with 20 degree each. Our assumption here is that the neural code for orientation in V1 is roughly shift-invariant.
(iii)
We calculated the euclidean distances between stimuli based on the extended population matrix (after variance-stabilizing transformation for Poisson noise, i.e., taking the square root transformation) to obtain a distance matrix, and performed the classic MDS based on this distance matrix. In most of the sessions, we performed 3-D MDS. In two of seven sessions, the latent fluctuations are smaller so that the cylinder structure does not appear in 3-D MDS. For these two sessions, we performed 5-D MDS. When plotting the first two and the fifth dimension in a 5-D MDS embedding, the cylinder-like structure is apparent.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

No experimental datasets were collected in this study. Three of the seven datasets used in this study are available from CRCNS data sharing website. The remaining 4 datasets were originally collected in Dr. Adam Kohn’s lab³⁰. Request of these datasets should be directed to the original authors who collected these data. Source data are provided with this paper.

Code availability

The R code that implements the Poisson functional PCA method and related analyses is available in a public repository (GitHub: https://github.com/rong-zhu/PfPCA).

References

Hubel, D. H. & Wiesel, T. N. Receptive fields of single neurones in the cat’s striate cortex. J. Physiol. 148, 574 (1959).
Article CAS PubMed PubMed Central Google Scholar
Barlow, H. B., Blakemore, C. & Pettigrew, J. D. The neural mechanism of binocular depth discrimination. J. Physiol. 193, 327 (1967).
Article CAS PubMed PubMed Central Google Scholar
Campbell, F. W., Cleland, B. G., Cooper, G. F. & Enroth-Cugell, C. The angular selectivity of visual cortical cells to moving gratings. J. Physiol. 198, 237–250 (1968).
Article CAS PubMed PubMed Central Google Scholar
Fitzpatrick, D. C., Batra, R., Stanford, T. R. & Kuwada, S. A neuronal population code for sound localization. Nature 388, 871 (1997).
Article ADS CAS PubMed Google Scholar
Arieli, A., Sterkin, A., Grinvald, A. & Aertsen, A. D. Dynamics of ongoing activity: explanation of the large variability in evoked cortical responses. Science 273, 1868–1871 (1996).
Article ADS CAS PubMed Google Scholar
Treue, S. & Martinez Trujillo, J. C. Feature-based attention influences motion processing gain in macaque visual cortex. Nature 399, 575–579 (1999).
Article ADS CAS PubMed Google Scholar
McAdams, C. J. & Maunsell, J. H. R. Effects of attention on orientation-tuning functions of single neurons in macaque cortical area V4. J. Neurosci. 19, 431–441 (1999).
Article CAS PubMed PubMed Central Google Scholar
Wörgötter, F. et al. State-dependent receptive-field restructuring in the visual cortex. Nature 396, 165–168 (1998).
Article ADS PubMed Google Scholar
Kisley, M. A. & Gerstein, G. L. Trial-to-trial variability and state-dependent modulation of auditory-evoked responses in cortex. J. Neurosci. 19, 10451–10460 (1999).
Article CAS PubMed PubMed Central Google Scholar
Fiser, J., Chiu, C. & Weliky, M. Small modulation of ongoing cortical dynamics by sensory input during natural vision. Nature 431, 573–578 (2004).
Article ADS CAS PubMed Google Scholar
Fox, M. D., Snyder, A. Z., Zacks, J. M. & Raichle, M. E. Coherent spontaneous activity accounts for trial-to-trial variability in human evoked brain responses. Nat. Neurosci. 9, 23–25 (2006).
Article CAS PubMed Google Scholar
Maunsell, JohnH. R. & Treue, S. Feature-based attention in visual cortex. Trends Neurosci. 29, 317–322 (2006).
Article CAS PubMed Google Scholar
Hasenstaub, A., Sachdev, R. N. S. & McCormick, D. A. State changes rapidly modulate cortical neuronal responsiveness. J. Neurosci. 27, 9607–9622 (2007).
Article CAS PubMed PubMed Central Google Scholar
Poulet, J. F. A. & Petersen, C. C. H. Internal brain state regulates membrane potential synchrony in barrel cortex of behaving mice. Nature 454, 881–885 (2008).
Article ADS CAS PubMed Google Scholar
Ringach, D. L. Spontaneous and driven cortical activity: implications for computation. Curr. Opin. Neurobiology 19, 439–444 (2009).
Article CAS Google Scholar
Niell, C. M. & Stryker, M. P. Modulation of visual responses by behavioral state in mouse visual cortex. Neuron 65, 472–479 (2010).
Article CAS PubMed PubMed Central Google Scholar
Fenton, A. A. et al. Attention-like modulation of hippocampus place cell discharge. J. Neurosci. 30, 4613–4625 (2010).
Article CAS PubMed PubMed Central Google Scholar
Ecker, A. S. et al. State dependence of noise correlations in macaque primary visual cortex. Neuron 82, 235–248 (2014).
Article CAS PubMed PubMed Central Google Scholar
Lange, R. D. & Haefner, R. M. Characterizing and interpreting the influence of internal variables on sensory activity. Curr. Opin. Neurobiol. 46, 84–89 (2017).
Article CAS PubMed PubMed Central Google Scholar
Cumming, B. G. & Nienborg, H. Feedforward and feedback sources of choice probability in neural population responses. Curr. Opin. Neurobiol. 37, 126–132 (2016).
Article CAS PubMed PubMed Central Google Scholar
Bondy, A. G., Haefner, R. M. & Cumming, B. G. Feedback determines the structure of correlated variability in primary visual cortex. Nat. Neurosci. 21, 598–606 (2018).
Article CAS PubMed PubMed Central Google Scholar
Sclar, G. & Freeman, R. D. Orientation selectivity in the cat’s striate cortex is invariant with stimulus contrast. Exp. Brain Res. 46, 457–461 (1982).
Article CAS PubMed Google Scholar
Finn, I. M., Priebe, N. J. & Ferster, D. The emergence of contrast-invariant orientation tuning in simple cells of cat visual cortex. Neuron 54, 137–152 (2007).
Article CAS PubMed PubMed Central Google Scholar
Luczak, A., Bartho, P. & Harris, K. D. Gating of sensory input by spontaneous cortical activity. J. Neurosci. 33, 1684–1695 (2013).
Article CAS PubMed PubMed Central Google Scholar
Goris, R. L. T., Movshon, J. A. & Simoncelli, E. P. Partitioning neuronal variability. Nat. Neurosci. 17, 858 (2014).
Article CAS PubMed PubMed Central Google Scholar
Rabinowitz, N. C., Goris, R. L., Cohen, M. & Simoncelli, E. P. Attention stabilizes the shared gain of V4 populations. Elife 4, e08998 (2015).
Article PubMed PubMed Central Google Scholar
Hénaff, O. J., Boundy-Singer, Z. M., Meding, K., Ziemba, C. M. & Goris, R. L. T. Representation of visual uncertainty through neural gain variability. Nat. Commun. 11, 1–12 (2020).
Article Google Scholar
Lin, I.-C., Okun, M., Carandini, M. & Harris, K. D. The nature of shared cortical variability. Neuron 87, 644–656 (2015).
Article CAS PubMed PubMed Central Google Scholar
Schölvinck, M. L., Saleem, A. B., Benucci, A., Harris, K. D. & Carandini, M. Cortical state determines global variability and correlations in visual cortex. J. Neurosci. 35, 170–178 (2015).
Article PubMed PubMed Central Google Scholar
Arandia-Romero, I., Tanabe, S., Drugowitsch, J., Kohn, A. & Moreno-Bote, R. Multiplicative and additive modulation of neuronal tuning with population activity affects encoded information. Neuron 89, 1305–1316 (2016).
Article CAS PubMed PubMed Central Google Scholar
Whiteway, M. R., Socha, K., Bonin, V. & Butts, D. A. Characterizing the nonlinear structure of shared variability in cortical neuron populations using latent variable models. Neurons Behav. Data Anal. Theory 3, (2019).
Charles, A. S., Park, M., Weller, J. P., Horwitz, G. D. & Pillow, J. W. Dethroning the fano factor: a flexible, model-based approach to partitioning neural variability. Neural Comput. 30, 1012–1045 (2018).
Article MathSciNet PubMed PubMed Central MATH Google Scholar
Ghisovan, N., Nemri, A., Shumikhina, S. & Molotchnikoff, S. Long adaptation reveals mostly attractive shifts of orientation tuning in cat primary visual cortex. Neuroscience 164, 1274–1283 (2009).
Article CAS PubMed Google Scholar
Kohn, A. & Movshon, J. A. Adaptation changes the direction tuning of macaque MT neurons. Nat. Neurosci. 7, 764–772 (2004).
Article CAS PubMed Google Scholar
Felsen, G. et al. Dynamic modification of cortical orientation tuning mediated by recurrent connections. Neuron 36, 945–954 (2002).
Article CAS PubMed Google Scholar
Li, Y. et al. Broadening of inhibitory tuning underlies contrast-dependent sharpening of orientation selectivity in mouse visual cortex. J. Neurosci. 32, 16466–16477 (2012).
Article CAS PubMed PubMed Central Google Scholar
Ferster, D. & Miller, K. D. Neural mechanisms of orientation selectivity in the visual cortex. Annu. Rev. Neurosci. 23, 441–471 (2000).
Article CAS PubMed Google Scholar
Dragoi, V., Sharma, J. & Sur, M. Adaptation-induced plasticity of orientation tuning in adult visual cortex. Neuron 28, 287–298 (2000).
Article CAS PubMed Google Scholar
Reynolds, J. H., Pasternak, T. & Desimone, R. Attention increases sensitivity of V4 neurons. Neuron 26, 703–714 (2000).
Article CAS PubMed Google Scholar
Ecker, A. S., Denfield, G. H., Bethge, M. & Tolias, A. S. On the structure of neuronal population activity under fluctuations in attentional state. J. Neurosci. 36, 1775–1789 (2016).
Article CAS PubMed PubMed Central Google Scholar
Chance, F. S., Abbott, L. F. & Reyes, A. D. Gain modulation from background synaptic input. Neuron 35, 773–782 (2002).
Article CAS PubMed Google Scholar
Fellous, J.-M., Rudolph, M., Destexhe, A. & Sejnowski, T. J. Synaptic background noise controls the input/output characteristics of single cells in an in vitro model of in vivo activity. Neuroscience 122, 811–829 (2003).
Article CAS PubMed Google Scholar
Mitchell, S. J. & Silver, A. R. Shunting inhibition modulates neuronal gain during synaptic excitation. Neuron 38, 433–445 (2003).
Article CAS PubMed Google Scholar
Anderson, J. S., Carandini, M. & Ferster, D. Orientation tuning of input conductance, excitation, and inhibition in cat primary visual cortex. J. Neurophysiol. 84, 909–926 (2000).
Article CAS PubMed Google Scholar
Haimerl, C., Savin, C. & Simoncelli, E. P. Flexible information routing in neural populations through stochastic comodulation. Adv. Neural Inf. Processi. Syst. 32, 14379–14388 (2019).
Andersen, R. A. & Mountcastle, V. B. The influence of the angle of gaze upon the excitability of the light-sensitive neurons of the posterior parietal cortex. J. Neurosci. 3, 532–548 (1983).
Article CAS PubMed PubMed Central Google Scholar
Andersen, R. A., Essick, G. K. & Siegel, R. M. Encoding of spatial location by posterior parietal neurons. Science 230, 456–458 (1985).
Article ADS CAS PubMed Google Scholar
Holt, G. R. & Koch, C. Shunting inhibition does not have a divisive effect on firing rates. Neural Comput. 9, 1001–1013 (1997).
Article CAS PubMed Google Scholar
Rice, J. A. & Silverman, B. W. Estimating the mean and covariance structure nonparametrically when the data are curves. J. R. Stat. Soc. Series B (Methodol.), 233–243 (1991).
Jones, M. C. & Rice, J. A. Displaying the important features of large collections of similar curves. Am. Stat. 46, 140–145 (1992).
Google Scholar
James, G. M. Generalized linear models with functional predictors. J. R. Stat. Soc. Series B (Methodol.) 64, 411–432 (2002).
Ramsay, J. O. & Silverman, B. W. Functional Data Analysis. 2nd edn. (Springer, 2005).
Viviani, R., Grön, G. & Spitzer, M. Functional principal component analysis of fMRI data. Hum. Brain Mapping 24, 109–129 (2005).
Article Google Scholar
Collins, M., Dasgupta, S. & Schapire, R. E. A generalization of principal components analysis to the exponential family. Adv. Neural Inf. Process. Syst. 14, 617–624 (2002).
Smith, M. A. & Kohn, A. Spatial and temporal scales of neuronal correlation in primary visual cortex. J. Neurosci. 28, 12591–12603 (2008).
Article CAS PubMed PubMed Central Google Scholar
Okun, M. et al. Diverse coupling of neurons to populations in sensory cortex. Nature 521, 511–515 (2015).
Article ADS CAS PubMed PubMed Central Google Scholar
Huang, C. et al. Circuit models of low-dimensional shared variability in cortical networks. Neuron 101, 337–348 (2019).
Article CAS PubMed Google Scholar
Hennequin, G., Ahmadian, Y., Rubin, D. B., Lengyel, M. & Miller, K. D. The dynamical regime of sensory cortex: stable dynamics around a single stimulus-tuned attractor account for patterns of noise variability. Neuron 98, 846–860 (2018).
Article CAS PubMed PubMed Central Google Scholar
Seung, S. H. & Sompolinsky, H. Simple models for reading neuronal population codes. Proc. Natl Acad. Sci. USA 90, 10749–10753 (1993).
Article ADS CAS PubMed PubMed Central Google Scholar
Zhang, K. & Sejnowski, T. J. Neuronal tuning: to sharpen or broaden? Neural Comput. 11, 75–84 (1999).
Article CAS PubMed Google Scholar
Ecker, A. S., Berens, P., Tolias, A. S. & Bethge, M. The effect of noise correlations in populations of diversely tuned neurons. J. Neurosci. 31, 14272–14283 (2011).
Article CAS PubMed PubMed Central Google Scholar
Moreno-Bote, R. et al. Information-limiting correlations. Nat. Neurosci. 17, 1410–1417 (2014).
Article CAS PubMed PubMed Central Google Scholar
Kriegeskorte, N. & Wei, X.-X. Neural tuning and representational geometry. Nat. Rev. Neurosci. 22, 703–718 (2021).
Article CAS PubMed Google Scholar
Kriegeskorte, N., Mur, M. & Bandettini, P. A. Representational similarity analysis-connecting the branches of systems neuroscience. Front. Syst. Neurosci. 2, 4 (2008).
PubMed PubMed Central Google Scholar
Kriegeskorte, N. & Kievit, R. A. Representational geometry: integrating cognition, computation, and the brain. Trends Cognitive Sci. 17, 401–412 (2013).
Article Google Scholar
Ringach, D. L. The geometry of masking in neural populations. Nat. Commun. 10, 1–11 (2019).
Article Google Scholar
Rule, M. E., O’Leary, T. & Harvey, C. D. Causes and consequences of representational drift. Curr. Opin. Neurobiol. 58, 141–147 (2019).
Article CAS PubMed PubMed Central Google Scholar
Rule, M. E. et al. Stable task information from an unstable neural population. Elife 9, e51121 (2020).
Article CAS PubMed PubMed Central Google Scholar
Cowley, B. R. et al. Slow drift of neural activity as a signature of impulsivity in macaque visual and prefrontal cortex. Neuron 108, 551–567.e8 (2020).
Campbell, F. W., Cooper, G. F. & Enroth-Cugell, C. The spatial selectivity of the visual cells of the cat. J. Physiol. 203, 223 (1969).
Article CAS PubMed PubMed Central Google Scholar
O’Keefe, J. & Dostrovsky, J. The hippocampus as a spatial map: Preliminary evidence from unit activity in the freely-moving rat. Brain Res. 34, 171–175 (1971).
Maunsell, J. H. & Van Essen, D. C. Functional properties of neurons in middle temporal visual area of the macaque monkey. I. Selectivity for stimulus direction, speed, and orientation. J. Neurophysiol. 49, 1127–1147 (1983).
Article CAS PubMed Google Scholar
Taube, J. S., Muller, R. U. & Ranck, J. B. Head-direction cells recorded from the postsubiculum in freely moving rats. I. Description and quantitative analysis. J. Neurosci. 10, 420–435 (1990).
Article CAS PubMed PubMed Central Google Scholar
Roitman, J. D. & Shadlen, M. N. Response of neurons in the lateral intraparietal area during a combined visual discrimination reaction time task. J. Neurosci. 22, 9475–9489 (2002).
Article CAS PubMed PubMed Central Google Scholar
Stringer, C., Pachitariu, M., Steinmetz, N., Carandini, M. & Harris, K. D. High-dimensional geometry of population responses in visual cortex. Nature 571, 361–365 (2019).
Article CAS PubMed PubMed Central Google Scholar
Martignon, L. et al. Neural coding: higher-order temporal patterns in the neurostatistics of cell assemblies. Neural Comput. 12, 2621–2653 (2000).
Article CAS PubMed Google Scholar
Paninski, L., Shoham, S., Fellows, M. R., Hatsopoulos, N. G. & Donoghue, J. P. Superlinear population encoding of dynamic hand trajectory in primary motor cortex. J.Neurosci. 24, 8551–8561 (2004).
Article CAS PubMed PubMed Central Google Scholar
Truccolo, W., Eden, U. T., Fellows, M. R., Donoghue, J. P. & Brown, E. N. A point process framework for relating neural spiking activity to spiking history, neural ensemble, and extrinsic covariate effects. J. Neurophysiol. 93, 1074–1089 (2005).
Article PubMed Google Scholar
Pillow, J. W. et al. Spatio-temporal correlations and visual signalling in a complete neuronal population. Nature 454, 995–999 (2008).
Article ADS CAS PubMed PubMed Central Google Scholar
Zoltowski, D. M., Latimer, K. W., Yates, J. L., Huk, A. C. & Pillow, J. W. Discrete stepping and nonlinear ramping dynamics underlie spiking responses of LIP neurons during decision-making. Neuron 102, 1249–1258 (2019).
Article CAS PubMed Google Scholar
Miller, K. D. & Troyer, T. W. Neural noise can explain expansive, power-law nonlinearities in neural response functions. J. Neurophysiol. 87, 653–659 (2002).
Article PubMed Google Scholar
Carandini, M. Amplification of trial-to-trial response variability by neurons in visual cortex. PLoS Biol. 2, e264 (2004).
Article PubMed PubMed Central Google Scholar
Priebe, N. J. & Ferster, D. Inhibition, spike threshold, and stimulus selectivity in primary visual cortex. Neuron 57, 482–497 (2008).
Article CAS PubMed Google Scholar
Orbán, G., Berkes, P., Fiser, J. & Lengyel, M. Neural variability and sampling-based probabilistic representations in the visual cortex. Neuron 92, 530–543 (2016).
Article PubMed PubMed Central Google Scholar
Schwartz, O. & Simoncelli, E. P. Natural signal statistics and sensory gain control. Nat. Neurosci. 4, 819–825 (2001).
Article CAS PubMed Google Scholar
Coen-Cagli, R. & Solomon, S. S. Relating divisive normalization to neuronal response variability. J. Neurosci. 39, 7344–7356 (2019).
Article CAS PubMed PubMed Central Google Scholar
Festa, D., Aschner, A., Davila, A., Kohn, A. & Coen-Cagli, R. Neuronal variability reflects probabilistic inference tuned to natural image statistics. Nat. Commun. 12, 3635 (2021).
Mariño, J. et al. Invariant computations in local cortical networks with balanced excitation and inhibition. Nat. Neurosci. 8, 194 (2005).
Article PubMed Google Scholar
Kenet, T., Bibitchkov, D., Tsodyks, M., Grinvald, A. & Arieli, A. Spontaneously emerging cortical representations of visual attributes. Nature 425, 954–956 (2003).
Article ADS CAS PubMed Google Scholar
Suzuki, M. & Larkum, M. E. General anesthesia decouples cortical pyramidal neurons. Cell 180, 666–676 (2020).
Article CAS PubMed Google Scholar
Alkire, M. T., Hudetz, A. G. & Tononi, G. Consciousness and anesthesia. Science 322, 876–880 (2008).
Article ADS CAS PubMed PubMed Central Google Scholar
Filipchuk, A., Schwenkgrub, J., Destexhe, A. & Bathellier, B. Awake perception is associated with dedicated neuronal assemblies in the cerebral cortex. Nat. Neurosci. 25, 1327–1338 (2022).
Article CAS PubMed PubMed Central Google Scholar
Yu, B. M. et al. Gaussian-process factor analysis for low-dimensional single-trial analysis of neural population activity. Adv. Neural Inf. Process. Syst. 21, 1881–1888 (2009).
Macke, J. H. et al. Empirical models of spiking in neural populations. Adv. Neural Inf. Process. Syst. 24, 1350–1358 (2012).
Zhao, Y. & Park, Il. M. Variational latent Gaussian process for recovering single-trial dynamics from population spike trains. Neural Comput. 29, 1293–1316 (2017).
Article MathSciNet PubMed MATH Google Scholar
Wu, A., Roy, N. A., Keeley, S. & Pillow, J. W. Gaussian process based nonlinear latent structure discovery in multivariate spike train data. Adv. Neural Inf. Process. Syst. 30, 3496 (2017).
PubMed PubMed Central Google Scholar
Duncker, L. & Sahan, M. Temporal alignment and latent Gaussian process factor inference in population spike trains. Adv. Neural Inf. Process. Syst. 31, 10466–10476 (2018).
Keeley, S. L., Aoi, M. C., Yu, Y., Smith, S. L. & Pillow, J. W. Identifying signal and noise structure in neural population activity with Gaussian process factor models. Adv. Neural Inf. Process. Syst. 33, 13795–13805 (2020).
Lee, J., Joshua, M., Medina, J. F. & Lisberger, S. G. Signal, noise, and variation in neural and sensory-motor latency. Neuron 90, 165–176 (2016).
Article CAS PubMed PubMed Central Google Scholar
Dempster, A. P., Laird, N. M., & Rubin, D. B. Maximum likelihood from incomplete data via the EM algorithm. J. R. Stat. Soc. Series B (Methodol.), 39, 1–22 (1977).
Wahba, G. A comparison of GCV and GML for choosing the smoothing parameter in the generalized spline smoothing problem. Ann. Stat. 13, 1378–1402 (1985).
Article MathSciNet MATH Google Scholar
Silverman, B. W. Smoothed functional principal components analysis by choice of norm. Ann. Stat. 24, 1–24 (1996).
Article MathSciNet MATH Google Scholar
Kelly, R. C., Smith, M. A., Kass, R. E. & Lee, T. S. Local field potentials indicate network state and account for neuronal response variability. J. Comput. Neurosci. 29, 567–579 (2010).
Article PubMed PubMed Central MATH Google Scholar
Britten, K. H., Shadlen, M. N., Newsome, W. T. & Movshon, A. J. Responses of neurons in macaque MT to stochastic motion signals. Visual Neurosci. 10, 1157–1169 (1993).
Article CAS Google Scholar
Meister, M. & Bonhoeffer, T. Tuning and topography in an odor map on the rat olfactory bulb. J. Neurosci. 21, 1351–1360 (2001).
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

We thank Adam Kohn, Matt Smith, and Inigo Arandia-Romero for sharing the V1 data. We thank Liam Paninski, Robbe Goris and Nikolaus Kriegeskorte for fruitful discussions. We thank Kenneth Kay, Yvonne Li, Matthew Whiteway, and Mattia Rigotti for comments on an earlier version of this paper. R.J.B.Z. acknowledges support from Science and Technology Innovation 2030 - Brain Science and Brain-Inspired Intelligence Project (2021ZD0200204), Shanghai Municipal Science and Technology Major Project (2018SHZDZX01), and National Natural Science Foundation of China (11871459). X.X.W. is supported by the startup funds provided by The University of Texas at Austin.

Author information

Authors and Affiliations

Institute of Science and Technology for Brain-Inspired Intelligence, Fudan University, Shanghai, China
Rong J. B. Zhu
MOE Key Laboratory of Computational Neuroscience and Brain-Inspired Intelligence, and MOE Frontiers Center for Brain Science, Shanghai, China
Rong J. B. Zhu
Department of Neuroscience, The University of Texas at Austin, Austin, USA
Xue-Xin Wei
Department of Psychology, The University of Texas at Austin, Austin, USA
Xue-Xin Wei
Center for Perceptual Systems, The University of Texas at Austin, Austin, USA
Xue-Xin Wei
Center for Theoretical and Computational Neuroscience, The University of Texas at Austin, Austin, USA
Xue-Xin Wei

Authors

Rong J. B. Zhu
View author publications
You can also search for this author in PubMed Google Scholar
Xue-Xin Wei
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

R.J.B.Z. and X.X.W. jointly designed and performed the research, interpreted the results, and wrote the paper.

Corresponding authors

Correspondence to Rong J. B. Zhu or Xue-Xin Wei.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks the anonymous reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Reporting Summary

Source data

Source Data

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Zhu, R.J.B., Wei, XX. Unsupervised approach to decomposing neural tuning variability. Nat Commun 14, 2298 (2023). https://doi.org/10.1038/s41467-023-37982-z

Download citation

Received: 18 March 2022
Accepted: 07 April 2023
Published: 21 April 2023
DOI: https://doi.org/10.1038/s41467-023-37982-z

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.