Sensory integration dynamics in a hierarchical network explains choice probabilities in cortical area MT

Wimmer, Klaus; Compte, Albert; Roxin, Alex; Peixoto, Diogo; Renart, Alfonso; de la Rocha, Jaime

doi:10.1038/ncomms7177

Download PDF

Article
Open access
Published: 04 February 2015

Sensory integration dynamics in a hierarchical network explains choice probabilities in cortical area MT

Klaus Wimmer¹,
Albert Compte¹,
Alex Roxin^1,2,
Diogo Peixoto^3,4,
Alfonso Renart³ &
…
Jaime de la Rocha¹

Nature Communications volume 6, Article number: 6177 (2015) Cite this article

12k Accesses
98 Citations
25 Altmetric
Metrics details

Subjects

Abstract

Neuronal variability in sensory cortex predicts perceptual decisions. This relationship, termed choice probability (CP), can arise from sensory variability biasing behaviour and from top-down signals reflecting behaviour. To investigate the interaction of these mechanisms during the decision-making process, we use a hierarchical network model composed of reciprocally connected sensory and integration circuits. Consistent with monkey behaviour in a fixed-duration motion discrimination task, the model integrates sensory evidence transiently, giving rise to a decaying bottom-up CP component. However, the dynamics of the hierarchical loop recruits a concurrently rising top-down component, resulting in sustained CP. We compute the CP time-course of neurons in the medial temporal area (MT) and find an early transient component and a separate late contribution reflecting decision build-up. The stability of individual CPs and the dynamics of noise correlations further support this decomposition. Our model provides a unified understanding of the circuit dynamics linking neural and behavioural variability.

Targeted V1 comodulation supports task-adaptive sensory decisions

Article Open access 30 November 2023

Large-scale dynamics of perceptual decision information across human cortex

Article Open access 09 October 2020

Decoding and perturbing decision states in real time

Article 20 January 2021

Introduction

The neural basis of decision making has been studied extensively in monkeys performing perceptual discrimination tasks¹. One influential finding is that the response variability of single neurons in visual areas such as MT is predictive of the monkey’s choice. A common measure of this correlation is ‘choice probability’ (CP)², the probability that an ideal observer can predict the monkey’s choice solely based on the number of spikes fired by a neuron. CPs above chance level have been found consistently across the visual system^3,4, in a variety of discrimination tasks^{2,5,6,7,8,9,10}.

Two different interpretations of CP in sensory neurons have emerged: in the bottom-up interpretation, variability in the choice is partly caused by variability in the response of sensory neurons, and CP quantifies this causal relationship². This interpretation can be formalized in a feedforward network model¹¹, where (1) the choice is determined by comparing the pooled activity of noisy sensory neurons across two populations with opposite stimulus preferences, and (2) neuronal variability within these populations is positively correlated^11,12. These noise correlations have generally been observed experimentally^10,13,14, but their magnitude and spatio-temporal structure seem to vary across areas, species and experimental conditions. In the top-down interpretation^9,15,16,17, the variability of sensory neurons that correlates with choice arises due to trial-to-trial fluctuations in top-down signals, which modulate the magnitude of the evoked responses^18,19,20. The nature of these top-down signals remains, however, largely unknown: it is not clear on what time-scale they operate¹⁶, what causes their variability, and whether they are generated before the stimulus presentation, reflecting some kind of bias or expectation, or they are instead recruited by sensory inputs as some kind of bottom-up attentional signal. In any case, CP due to top-down inputs reflects computations that escape the control of the experimenter and cause trial-to-trial response variability that is not necessarily noise.

To differentiate between bottom-up and top-down mechanisms, a recent study compared the dynamics of sensory evidence integration and the time-course of CP in a disparity discrimination task⁹. They found that the impact of stimulus fluctuations on the decision decreases over time, whereas CP increases and reaches a plateau. This indicates that CP cannot be exclusively due to the causal effects of sensory activity on the decision and supports a non-causal relation through top-down signals. However, top-down connections from associative to sensory areas could give rise to recurrent loops across the cortical hierarchy, questioning the rationale of establishing the direction of causality. Whether this recurrent interaction exists and how it may impact the dynamics of sensory integration remains to be elucidated. A further challenge for interpreting CP is that it is directly linked to the structure of noise correlations¹², but the sources of correlations are not well understood. On one hand, it has become clear that correlations are not a fixed hard-wired property of sensory circuits but depend on a number of factors including the context of the task¹⁴ and attentional states^18,21,22. On the other hand, theoretical work has shown that shared inputs do not necessarily cause correlations in recurrently connected networks²³, so that we currently lack a canonical network model that can generate a structure of noise correlations as measured experimentally. The emerging view is that correlations do not have a unique origin but can be caused, in addition to hard-wired connectivity, by feedforward (for example, eye movements²⁴ or stimulus fluctuations²⁵), intrinsic (for example, stochastic global fluctuations of ongoing activity²⁶) and top-down sources^14,20, making CPs hard to interpret^3,4.

Here, we present a hierarchical network model of spiking neurons, representing a sensory and an associative cortical area and carrying out the discrimination of two stimulus categories. Noise correlations between sensory neurons together with topographical top-down connections give rise to CP that is generally composed of two contributions: a bottom-up component, which peaks after stimulus onset and decreases as the decision is being formed, and a top-down component, which simultaneously increases until reaching a steady level. We analyse single unit and paired unit recordings from a classic motion discrimination experiment^2,13 and show that CPs in MT have an early bottom-up component (<500 ms after stimulus onset) revealed by trial-to-trial stimulus fluctuations. Leveraging on the heterogeneity exhibited by individual CP time-courses, we further show the contribution of a late component, consistent with slowly varying top-down signals that represent the upcoming choice. This slow late component is also revealed by the rising time-course of lagged spike count noise correlations. Thus, our model elucidates how the emergent dynamics, developing across the hierarchy during sensory integration, frames the relationship between neuronal and behavioural variability in perceptual decision tasks.

Results

Mechanisms underlying correlations and CP

We developed a computational model of perceptual decision making that allowed us to isolate and quantify the dynamics and the relative contributions of bottom-up and top-down mechanisms to spike count noise correlations and CP of sensory neurons. We simulated a standard two-alternative forced-choice motion discrimination task using fixed-duration random dot kinematograms (RDKs)^1,2,3,13. The spiking network consists of an integration circuit (for example, LIP, FEF) and a sensory circuit (MT), recurrently coupled via bottom-up feedforward connections and top-down feedback connections (Fig. 1a). The integration circuit accumulates sensory evidence and produces a binary categorization due to winner-take-all competition between two decision-encoding populations²⁷. The sensory circuit contains neural populations selective to opposite directions of motion, the average responses of which vary approximately linearly with the stimulus coherence. We primarily studied responses to ambiguous, zero-coherence stimuli, which maximize behavioural variability. We used stimuli that caused strong temporal modulation of sensory population rates (Fig. 1b) comparable to those produced by the temporal variations in motion energy caused by RDKs in MT neurons^28,29 (see Methods). These time-varying rates are integrated by populations D1 and D2 in the integration circuit until the network reaches the attractor state associated with one of the choices²⁷ (Figs 1b and 2a). Because we are interested in the relationship between neuronal and choice variability, we set the sensory circuit to operate in the balanced regime^30,31 where neurons exhibit large response variability³². We wanted to characterize the structure of noise correlations, as it provides the link between single neuron variability and behaviour^11,12. We classified correlations among sensory neurons as bottom-up or top-down, depending on whether they were generated in the absence or by virtue of top-down feedback connections, respectively.

**Figure 1: Model architecture and single-trial response.**

**Figure 2: Impact of bottom-up and top-down correlations on choice probability.**

To study the dynamics of CP caused by bottom-up correlations only, top-down connections in the network were removed. We first noticed that because the sensory circuit operated in the balanced regime, average correlations were marginally small despite the presence of anatomically shared inputs²³. As a consequence, the network did not give rise to substantial CP. We therefore investigated two other potential sources of bottom-up noise correlations in the model: (1) trial-to-trial stimulus fluctuations and (2) the spatial arrangement of external background inputs to the sensory populations. To generate trial-to-trial stimulus fluctuations, we used different realizations of the time-varying zero-coherent stimulus in each trial. This condition, generally used in experiments, was termed non-replicate to distinguish it from repeated presentations of identical replicate stimuli. As a consequence of the non-replicate condition, neuronal pairs within the same population (E1E1 and E2E2) showed positive trial-to-trial correlations (Fig. 2c) reflecting the co-modulation of the evoked rates around the mean response obtained over different zero-coherent stimuli (Supplementary Fig. 1). Thus, these correlations should technically be termed signal correlations instead of noise correlations. Across-population correlations (E1E2) were negative due to competition between E1 and E2 mediated by common inhibition (Supplementary Fig. 2 and Methods). Thus, the difference between correlations within the same population and correlations across populations was large and constant throughout the stimulus period (Fig. 2c), a necessary condition to yield sustained CP¹². Despite this, CP showed a fast rise followed by a slow decay towards chance level (Fig. 2b). This CP time-course is a direct consequence of the non-linear dynamics of the integration circuit, which, as the trial progresses, approaches an attractor causing a decrease of the impact of the sensory activity fluctuations on the upcoming decision^27,33 (Fig. 2a inset). We call this effect transient evidence integration.

The same qualitative structure of bottom-up correlations was generated by, instead of using non-replicate stimuli, modifying the spatial arrangement of the external background inputs. In circuits with strong global inhibition, spatially localized external inputs are more effective in generating large amplitude fluctuations in population rates than non-specific global background inputs³⁴ (Supplementary Fig. 2). Thus, local background inputs specifically targeting E1 and E2 were able to generate correlations with the same magnitude and structure as stimulus fluctuations, leading to virtually identical CP time-courses (Fig. 2b). In addition, the correlations produced by local background inputs caused a substantial decrease in the accuracy of the categorization^11,35 (Supplementary Fig. 3d), which was already suboptimal in the absence of correlations due to transient evidence integration²⁷ (Supplementary Fig. 3e).

Given the generality of transient integration, the same qualitative CP time-course is obtained across a broad range of correlation amplitudes (Supplementary Fig. 4), if correlations are caused by other bottom-up sources (for example, inherited from an upstream area), or if the integration circuit is replaced with a bounded integrator²⁹ (Supplementary Fig. 5b). Thus, given the invariance of the CP dynamics to the exact origin of bottom-up correlations, we used stimulus fluctuations to develop our experimental predictions as they are easy to manipulate.

Next, we investigated the impact of top-down signals by including weak feedback connections from the neurons in the integration circuit to the corresponding sensory populations (D1 to E1, D2 to E2; Fig. 1a). To isolate the effect of top-down feedback, we removed bottom-up correlations by using global background inputs and by presenting replicate stimuli. As evidence supporting one choice built up in the integration circuit, top-down inputs projected this gradual increase in activity and produced a small boost in the rate of the corresponding sensory neurons, particularly towards the end of the stimulus period (Fig. 2d). This choice-dependent increase in rate generated both correlations between sensory neurons and a CP with a ramp-and-plateau time-course mirroring the build-up of the decision (Fig. 2e,f). The amplitude of the CP increased with the strength of top-down connections (Supplementary Fig. 4e–g).

When taken in isolation, neither bottom-up correlations nor top-down connections could account for the fast rise in CP followed by a plateau that is observed experimentally². However, when taken together, the two mechanisms could reproduce this time-course (Fig. 3a). The resulting CP was approximately the sum of the CPs obtained due to either bottom-up or top-down contributions alone, regardless of whether bottom-up correlations were caused by one or several factors (Fig. 3d). The sustained CP time-course was not a general feature of the model but depended on the relative strength of these contributions (Supplementary Figs 3b and 4a,e). However, the decaying bottom-up and the rising top-down component were both governed by the dynamics of sensory evidence integration and thus evolved with the same time-scale. This made CP roughly invariant to changes in decision dynamics (Fig. 3a,c). The network’s psychophysical kernel, obtained from the difference between the average stimuli yielding each choice^9,29 (Fig. 3b), revealed that, despite the sustained CP, the network performed transient integration of sensory evidence. Finally, the decomposition of CP into bottom-up and top-down components was also qualitatively unchanged if the average correlation across all pairs, which was close to zero in our model (Fig. 2c,f), was increased while maintaining the difference corr(EiEi)—corr(EiEj) (Supplementary Fig. 6).

**Figure 3: Bottom-up correlations together with top-down signals can lead to sustained CP and a decaying psychophysical kernel.**

CP in MT during a motion discrimination task

We then asked whether CP in MT neurons during a fixed-duration motion discrimination task could also be decomposed, like in the model, into (1) an early, bottom-up component, reflecting in part the impact of stimulus fluctuations and (2) a late, top-down component that reflects feedback from an integration circuit. We reanalysed responses to zero-coherence RDKs from classical monkey experiments^2,13, which yield a sustained time-course of the population-averaged CP² (Supplementary Fig. 7a). However, sustained CP could also be explained by perfect evidence integration throughout the whole stimulus period (that is, without a bound) in the absence of top-down signals (Supplementary Fig. 5e). This is unlikely because perfect integration weighs the evidence uniformly, yielding a sustained psychophysical kernel (Supplementary Fig. 5f), in contrast to the decaying psychophysical kernels obtained in similar tasks^9,29 and in two monkeys that we trained to perform the fixed-duration motion discrimination task^2,13 (Supplementary Fig. 7c,e; Methods).

We first searched for evidence in the MT data of an early bottom-up component of CP that could be revealed by manipulating the magnitude of bottom-up sensory correlations. To do this, we compared responses in the non-replicate and the replicate conditions, as a way to assess the impact of bottom-up correlations, caused by trial-to-trial stimulus fluctuations, on CP^{2,10,25,28,36}. We first confirmed that stimulus fluctuations had a sustained impact on neuronal variability, as we found substantially lower spike count Fano factors and correlations in MT for the duration of the stimulus in the replicate condition²⁵ (Fig. 4a,b). We then tested the prediction of our model, that replicate stimuli should produce CP of smaller magnitude, particularly early during the stimulus presentation, while the network is integrating the sensory evidence, and not late, when CP is mainly reflecting the impact of top-down inputs (Fig. 3a,d). Indeed, we found that CP in the replicate condition was significantly lower than for the non-replicate condition early (epoch 0–1,000 ms), but not late (epoch 1,000–2,000 ms), in the trial (Fig. 4c). Significance was assessed using a mixed-effects ANOVA (with factors epoch (early/late), stimulus (repl./non-repl.), and random factors monkey and neuron identity) for 250 ms, showing a significant interaction effect of epoch × stimulus, F_(1,317)=4.75, P=0.03, n=41 (118) neurons for repl. (non-repl.). One-tailed t-tests revealed a significant difference of repl. versus non-repl. for early (CP(repl.)=0.512±0.008, CP(non-repl.)=0.532±0.005, P=0.02, t(157)=−2.07) but not late (CP(repl.)=0.533±0.011, CP(non-repl.)=0.529±0.006, P=0.72, t(157)=0.353). Thus, a substantial part of CP early in the trial can be attributed to bottom-up correlations caused by stimulus fluctuations. The higher impact of bottom-up correlations early in the trial is a signature of transient integration of sensory evidence and rules out the possibility that the monkeys performed perfect integration (Supplementary Fig. 5e,f). The similar CP late in the trial is consistent with the proposed top-down component, which is present independently of the stimulus condition (replicate versus non-replicate).

**Figure 4: Stimulus fluctuations impact neuronal variability and reveal a bottom-up component of choice probability in MT.**

Stability of individual CPs increases through the trial

We next searched for evidence of a separate contribution to CP caused by late top-down inputs by investigating the heterogeneity of CP profiles across MT neurons. We reasoned that if CP is generated by two separate mechanisms, individual neurons might not participate equally in the two, particularly if they come from layers with a different density of bottom-up versus top-down inputs. So far, we used a homogeneous network, in which all sensory neurons received a stimulus input with the same strength, and top-down connections with equal probability. In the homogeneous network, when the average CP was sustained, individual CP time-courses caused by non-replicate stimuli were also sustained (Fig. 5a), only showing heterogeneity in their amplitude due to heterogeneous firing rates. This stability of CPs over time can be captured by the rank correlation across neurons of the CP measured at two different time bins (CP correlation C(t_i,t_j); Fig. 5a inset), which remains high for all time bin pairs (Fig. 5b). In particular, the CP correlation between adjacent time bins C(t_i,t_i+1) was constant through the trial (Fig. 5b, right). Introducing heterogeneity in the efficacies of stimulus and top-down inputs in the circuit (Methods) can yield sustained average CP and heterogeneous individual CP time-courses (Fig. 5c and Supplementary Fig. 9). Individual profiles showed combinations of fast-rise-and-decay and slow-ramp-and-plateau behaviour, depending on whether they received stimulus and/or top-down inputs (Fig. 5c). Consequently, CPs between adjacent time bins were less correlated at the beginning of the trial (Fig. 5d) because the impact of trial-to-trial stimulus fluctuations made the individual CPs change rapidly (Fig. 3a). Correlations between adjacent bins increased towards the second half of the trial as individual CPs became more stable (Fig. 5d). This analysis is robust to small trial number (Supplementary Fig. 9).

**Figure 5: Heterogeneity of individual choice probabilities reveals increased stability towards the end of the stimulus interval.**

Individual CP time-courses of MT neurons in response to non-replicate stimuli also showed large heterogeneity (Fig. 5e) in spite of yielding a sustained average CP (Supplementary Fig. 7a). As in the heterogeneous network, CP correlations between adjacent time bins were low at the beginning and increased significantly through the trial (Fig. 5f; linear regression intercept=–0.05 and slope=0.25 s^–1 significantly different from zero, P<0.001, permutation test). Moreover, the average CP correlation across time bin pairs within the first half of the trial was significantly smaller than within the second half (mean of C(t_i,t_j)=0.096±0.026 for t_i, t_j<1 s, i≠j, and 0.275±0.025 for t_i, t_j>1 s, i≠j, respectively; P<0.001, n=6 time bin pairs, permutation test). Thus, individual CP time-courses in MT were heterogeneous and showed increasing stability towards the end of the stimulus, consistent with the slow build-up of a top-down contribution.

Lagged correlations predict late CP

Finally, we looked for further evidence of a late and slow top-down contribution to CP by analysing spike count correlations in MT simultaneous pair recordings^13,25. In the model, pairwise correlations produced by stimulus fluctuations were short-lived (~50–100 ms; Fig. 6a), mimicking fluctuations in motion energy at the speed of the coherent motion produced by RDKs (Methods). In contrast, correlations caused by top-down inputs were weaker, but as they built up (Fig. 2f), they extended to time lags of a few hundred milliseconds (Fig. 6b). This is because these correlations are caused by trial-to-trial variations in the population rates of the integration circuit that remain relatively stable during the late part of the stimulus (Fig. 1b). When both sources are combined (Fig. 6c), the model predicts large and sustained correlations at zero lag (Fig. 6d) and weak lagged correlations with a rising time-course (Fig. 6e). We tested this prediction in MT pairs and found that both the instantaneous and lagged correlations averaged over pairs resembled those predicted by the model (Fig. 6f,g). Moreover, individual pairs with rising lagged correlations tended to exhibit large values of late CP (Fig. 6h). This is consistent with our heterogeneous model, where rising lagged correlations and increasing CP time-courses were specific of neural pairs receiving top-down inputs (Supplementary Fig. 10). We thus divided MT pairs in two groups based on the sign of the slope of lagged correlations (Fig. 6h). As predicted by the model, neurons showing rising lagged correlations yielded a larger late CP than neurons showing a decay (Fig. 6i,j). The difference in CP specifically occurred late in the trial (Fig. 6i,j; mixed-effects ANOVA with factors epoch (early/late), slope (positive/negative), and random factor neuron identity, revealed a significant interaction epoch × slope, F_(1,129)=4.14, P=0.046, n=22 (43) for positive (negative) slope, and a significant difference of negative versus positive for late but not early, P=0.007, t(63)=2.55 and P=0.33, t(63)=0.443, one-tailed t-tests). Neurons with rising lagged correlations also showed higher CP at stimulus onset. We speculate that this difference could be due to pre-stimulus expectation signals⁹, not implemented in the model, that modulate sensory activity using the same top-down pathway.

**Figure 6: MT neuron pairs with rising lagged correlations exhibit enhanced late CP.**

Top-down feedback enhances the stability of the categorization

Functionally, top-down connections generated a positive feedback loop across the hierarchy that modified the decision dynamics and enhanced the stability of the categorization (Fig. 7). We arbitrarily defined a non-absorbing bound in the firing rate that, when reached by population D1 or D2, indicates strong evidence in favour of motion in the corresponding direction. After the first bound crossing, the attractor dynamics of the integration circuit tended to maintain this state until the end of the stimulus when the decision had to be taken²⁷. This occurred unless a large fluctuation in sensory activity reversed the competition between D1 and D2 (ref. 27) (Fig. 7a). Adding top-down connections altered the dynamics by preventing some reversals observed in the network without top-down (Fig. 7b). This occurs because, when approaching the bound, top-down feedback generates a difference between the rates of the two sensory populations (Fig. 2d) that enhances the stability of the current state of the integration circuit (Fig. 7b and Fig. 2d inset). Thus, stronger top-down feedback yielded fewer reversals (Fig. 7d). Raising the bound decreased the fraction of reversal trials but increased the fraction of trials without threshold crossing (that is, weak-confidence trials³⁷). Stronger top-down feedback increased the difference between D1 and D2 rates at stimulus offset, resulting in fewer weak-confidence trials (Fig. 7d,e). Consistent with fewer reversals (Fig. 7d), stronger top-down feedback yielded shorter integration windows and a weaker impact of late stimulus fluctuations on the decision (Fig. 7f). Enhanced stability came at the cost of decreased discrimination accuracy across a broad range of stimulus coherences²⁷ revealing a trade-off between stability and accuracy (Fig. 7g). This trade-off can be portrayed as the standard speed-accuracy trade-off: shorter integration windows, resulting from stronger top-down inputs, yielded higher discrimination thresholds (Fig. 7h). Comparing the network’s performance with an optimal classifier shows that the decrease in accuracy due to the increase in top-down strength can be entirely attributed to the concurrent shortening of the integration window (Fig. 7h). Thus, the strength of top-down feed-back could be optimized to maximize reward rate depending on task details such as stimulus duration and the cost of erroneous and undecided trials.

**Figure 7: Top-down inputs strengthen the categorization dynamics and increase their stability.**

Discussion

We have developed a hierarchical network model to investigate how CP, the correlation between neuron response variability and perceptual decisions, emerges from recurrent cortical network dynamics. The model dissects CP into an early contribution, dominated by the impact of bottom-up trial-to-trial neuronal co-fluctuations on the decision, and a late contribution produced by top-down inputs into sensory neurons, which reflects the decision build-up in each trial. The time-courses of the two contributions are determined by the non-linear integration of sensory activity resulting from the dynamics of the hierarchical network: as the stimulus comes on, sensory evidence is gradually accumulated, until the integration circuit reaches a categorization state that is then maintained by the reverberant activity of the local circuit²⁷ as well as across the hierarchy. As this categorization state is much less sensitive to sensory fluctuations, bottom-up contributions to CP are confined to the initial accumulation phase and decay over time. Top-down contributions show a complementary rising time-course because they reflect the build-up and maintenance of the decision. When the two contributions have comparable magnitude, CP can exhibit a sustained time-course for a wide range of decision dynamics as observed in the classical motion discrimination task². The model, however, predicts that if the magnitude of the two contributions is different, CP should show a non-sustained time-course. We implemented the integration circuit using a spiking attractor network model²⁷. However, the decomposition of CP into complementary bottom-up and top-down contributions can be obtained with a broad family of decision models, including a firing rate attractor network model^33,38 or a linear integrator with absorbing bounds in combination with a post-decision feedback signal, but not with a leaky²⁹ or a perfectly linear integrator (no bounds).

We used trial-to-trial stimulus fluctuations (‘non-replicate’ condition) as one way to generate bottom-up correlations, that is, those generated in the absence of top-down connections, and to establish predictions that could be easily tested experimentally. While previous studies concluded that the impact of stimulus fluctuations on CP and neuronal variability was small or negligible^10,25,36,39, we found that non-replicate RDKs caused a substantial increase in spike count Fano factor and correlations compared with replicate RDKs (they accounted for ~35% of the correlations for coherences in the range 0–50% in two pairs of similarly tuned cells, T=100 ms). The differences with previous reports could be due to our smaller spike count windows^25,36 and lower coherences³⁹. We then compared CPs in MT for replicate and non-replicate stimuli and verified a key model prediction: while replicate stimuli caused a sustained decrease in correlations compared with non-replicate, they generated a decrease of CP only early and not late in the trial.

Strong local excitatory connections in combination with strong and global inhibition in the sensory circuit gave rise to competitive dynamics between the populations E1 and E2. These local dynamics conferred the spike count correlations caused by non-replicate stimuli or by local background inputs a distinctive structure: pairs within one population (EiEi) were positively correlated while mixed pairs (EiEj) were negatively correlated, yielding a near zero average across all pairs. Although similarly tuned MT neurons commonly exhibit larger correlations than oppositely tuned ones^13,14,39, only the correlations found in the medial superior temporal cortex (MST) of trained monkeys resemble the structure produced by the model, with negative correlations among pairs with opposed tuning and near zero average correlations across all pairs.⁴⁰ Because in our network CP depends on the difference in average correlations within (EiEi) and across populations (EiEj)¹², the decomposition into bottom-up and top-down components is invariant to changes in the correlation structure that maintain this difference (for example, an increase in the average correlation across all pairs). Bottom-up correlations caused by other types of stochastic network dynamics such as switches between multiple discrete attractors^41,42 or diffusion in a continuous line attractor^43,44,45 would in general also contribute to the early component of CP. Bottom-up correlations with a very slow time-scale produced by intrinsic network dynamics would generate CP that rises and decays slowly, and they could eventually cause above chance CP before stimulus onset⁴⁶. In sum, even if our analysis cannot identify the precise origin of bottom-up correlations, it succeeds in showing that any mechanism causing bottom-up fast correlations leads to a similar fast-rise-and-decay CP time-course.

Top-down inputs carrying decision-related or attentional signals also contribute to CP^9,17 and correlations^14,20. We designed a model-driven analysis that, based on the stability of the single cell CP traces, provided evidence for a late top-down contribution to CP. We defined the CP correlation as a measure of the stability of individual CP traces: early CP produced by bottom-up stimulus fluctuations caused temporally varying traces that crossed frequently yielding low CP correlation. Late in the stimulus, when CP is presumably dominated by slower top-down inputs, CP traces were more stable yielding higher CP correlation. Moreover, we found that lagged correlations between MT pairs showed a rising time-course during the stimulus that was predictive of the magnitude of late CP across cell pairs. Together, these findings are consistent with the presence of a top-down signal that, towards the end of each stimulus, selectively boosts the rate of a subset of MT cells aligned with the upcoming choice. Our analysis suggests why previous reports^25,39 might have missed lagged correlations: they are weak, confined to the late part of the stimulus and could be selectively expressed by a subpopulation of neurons receiving top-down inputs, as in attentional studies in V1 (ref. 20).

The impact of noise correlations on how accurately populations of neurons represent sensory information has typically been assessed in settings where tuning curves and correlation structure are treated as independent variables and where the effect of the dynamics of the recurrent circuitry has been mostly unexplored^47,48,49. Our approach is different in that our network function is not to accurately represent the stimulus, but rather to perform a binary categorization. This computation is implemented using strongly recurrent connectivity that induces competitive dynamics²⁷. Competition is crucial for the categorization in the integration circuit, but it also emerges in the sensory circuit via top-down feedback and fluctuations in the external inputs affecting both the tuning and the correlation structure. In particular, competition induces negative noise correlations across the two oppositely tuned sensory populations, a condition that impairs discrimination accuracy^11,50. Understanding how variability ultimately constrains function will require a systematic characterization of the relation between connectivity, dynamics and correlations in networks designed to carry out specific computations^51,52.

Our analyses support the idea that late CP in MT cells reflects the impact of top-down inputs on sensory evoked responses^8,9,15,17. Top-down inputs in this context have been interpreted as selective attentional signals¹⁹ whose allocation varies on multiple time-scales¹⁸, introducing co-variability in sensory populations^14,20 and biasing perceptual decisions^17,18. The specific nature and dynamics of these top-down inputs remains largely unknown. They include pre-stimulus expectation signals⁵³, reflecting the animal’s ‘guess’ about the upcoming stimulus, or attention signals recruited by the sensory activity produced by salient fluctuations at the beginning of the stimulus. Both of these signals bias sensory activity towards one choice before the subject commits to a decision⁹. Alternatively, they could be post-decision signals, reinforcing the representation of the chosen percept^8,9 or reflecting the motor response plan developing slowly during the stimulus interval⁵⁴. A pure post-decision signal could be implemented by replacing the integration circuit in our network with a linear integrator with absorbing bounds that sends a feedback signal to the sensory circuit once the bound is reached. Such a phenomenological model would yield similar bottom-up and a top-down CP components as our model. However, the top-down signals in our model are not purely post-decision but combine previous proposals of pre-decision and post-decision top-down signals⁹. Because the sensory and integration circuits are recurrently coupled and evolve dynamically in parallel, feedback signals are effectively recruited by activity of sensory circuits and impact the final decision by altering the dynamics during both the accumulation and the categorization maintenance periods⁵⁵. As a consequence, the top-down contribution to CP cannot be called ‘non-causal’ as would be the case for a post-decision signal. The differences between our mechanistic model and the phenomenological integration-to-bound plus feedback model are revealed after the first crossing of the categorization bound: while in our model it is easy to investigate the dynamics of changes of mind, caused for instance by stimuli with time-varying coherence⁵⁶ or by external stimulation⁵⁷, the phenomenological model requires further ad-hoc assumptions about the probability to escape the absorbing bound⁵⁸ and about when to switch the top-down input.

During fixed-duration motion discrimination task subjects categorize RDKs and settle on one alternative before the end of the stimulus, ignoring late evidence and performing suboptimally^29,59. Perception appears categorical even for low-coherence RDKs in a task that requires taking into account the probabilities of both motion alternatives to perform optimally⁶⁰. A recent study has proposed that transient evidence integration may arise from approximate inference using sequential neural sampling⁶¹. Attractor dynamics produced by recurrent lateral connections within integration circuits are a plausible mechanistic implementation of perceptual categorization²⁷. Recurrent feedback through top-down connections in our network strengthened this categorization dynamics and increased its stability at the cost of decreased accuracy. Although in our setting these two mechanisms seem to differ only quantitatively, the distinctive features of a top-down implementation would be revealed in tasks requiring the binding of several stimulus attributes (for example, motion and disparity), such as in the discrimination of the direction of rotation of a perceptually bistable cylinder⁸. Our hierarchical model pioneers the analysis of recurrent dynamics in bottom-up/top-down loops, a mechanism that may play a crucial role in the perceptual categorization of more complex stimuli.

Methods

Network model

The network model (Fig. 1a) consists of a sensory circuit reciprocally connected to an integration circuit. A standardized description of the model and all simulation parameters can be found in Supplementary Tables 1–4 (ref. 62).

The sensory circuit is a balanced randomly connected EI-network^23,30,31 (connection probability p=0.2) with 1,600 excitatory (E) and 400 inhibitory (I) leaky integrate-and-fire neurons. Model equations and parameter values are mostly taken from ref. 23. Synaptic transmission mimics AMPA and GABA_A receptor conductance dynamics²³ (mean efficacies =0.76 nS, =1.52 nS and 12.6 nS). Excitatory neurons are divided into two symmetric populations, E1 and E2, preferring opposite directions of motion. Connections within each population (E_i to E_i with i=1, 2) are stronger (potentiating factor w₊=1.3) than connections across populations (E_i to E_j with i≠j; weakening factor w₋=0.7), capturing the stronger coupling among cells with similar direction preference. The stimulus is modelled as a time-varying input current into each neuron k in population β=E1 or E2 (see details below). Neurons in E1 and E2 also receive weak top-down connections from populations D1 and D2 of the integration circuit, respectively (Fig. 1a; connection probability p_FB=0.2; synaptic weight g_FB=0.0668, nS·b_FB, with b_FB the dimensionless feedback strength that takes values in the range 0-6). All sensory neurons receive AMPA-like random connections from a external population (X) composed of 1,000 cells firing independent Poisson spike trains at a constant rate ν_ext=12.5 sp s⁻¹ (connection probability p_x=0.32; efficacy 1.71 nS; see Supplementary Fig. 2a for a variant with local background inputs). Spike trains from X cells were always different across trials. Thus, EiEi pairs share on average a fraction of 0.2, 0.32 and 0.2 of their recurrent, external and top-down inputs, respectively.

The integration circuit is a biophysical network model of decision-related activity in LIP²⁷, whose dynamics have been extensively studied^33,38. It contains 1,600 excitatory and 400 inhibitory leaky integrate-and-fire neurons, that are all-to-all connected. There are three populations of excitatory cells: D1 and D2 (240 cells each) represent the two choices, and the non-specific population Dn contains the rest of the E cells. Synaptic transmission mimics AMPA, NMDA and GABA_A receptor conductance dynamics²⁷ (efficacies , , , , , and ). Recurrent connections within D1 and D2 are stronger (factor w₊=1.6) than connections within Dn. Cells in D1 and D2 receive the sensory evidence via feedforward AMPA connections from neurons in E1 and E2, respectively (Fig. 1a; connection probability p_FF=0.2; efficacy ). Each neuron in the integration circuit receives an external independent Poisson spike train via AMPA synapses (rate 2,392 sp s⁻¹ to D1 and D2 and 2,400 sp s⁻¹ to Dn and I; efficacies , ). External spike trains were always different across trials.

Network dynamics. Although sensory and integration circuits followed the same connectivity scheme (Fig. 1a), the connectivity strengths differed such that they exhibited different dynamics. The sensory circuit generated weak competition between E1 and E2 allowing the network to operate in an approximately linear regime in which each population rate can track the stimulus input (Fig. 1b). This weak competition shaped the structure of correlations³⁴ generated by non-replicate stimuli (Fig. 2c), local background inputs (Supplementary Fig. 3c) and top-down signals (Fig. 2f): correlations between pairs within the same population (EiEi) are positive, whereas in mixed pairs (EiEj) they are negative. The average correlation across all pairs was near zero, a robust feature of balanced networks resulting from the dynamic balance of excitation and inhibition²³. In contrast, on stimulus presentation, the integration circuit exhibited non-linear dynamics as a consequence of strong winner-take-all competition and the existence of two attractors representing the two possible choices^27,33,38.

Stimulus model. The stimulus-driven input was modelled as an afferent current into each sensory neuron k in population β=(E1, E2):

with I₀=0.08 nA, the mean input for zero-coherence stimuli. The term s^β(t) is the stimulus, representing sensory evidence for motion in the β-direction. It is common to all neurons of population β and independent between the two populations (except in Supplementary Fig. 6). The term is independently generated for each neuron k, mimicking heterogeneity in the afferent input. Figure 1b, shows an example of s^E1(t) and s^E2(t) (bottom traces) and Supplementary Fig. 1b,c compares s^β(t) and for two neurons (top traces). The two terms are given by

and

where c is the stimulus coherence and γ^β the average additional input at highest coherence c=1. Without loss of generality, we assume that the stimulus is moving in the preferred direction of E1 neurons, that is, we use a positive γ^E1 and a negative γ^E2 with γ^E2=−γ^E1, so that the firing rates of E1 (E2) neurons increase (decrease) approximately linearly with c, as observed experimentally³⁶. Temporal modulations in sensory input generated by the specific realization of the dot trajectories in RDKs are captured by the time-varying terms z^β(t) and (independent Ornstein-Uhlenbeck processes with zero mean, s.d. equal one and time constant τ_stim=20 ms). The amplitude of the temporal modulations is set by σ_stim=σ_ind,k=0.212 σ, where σ is the dimensionless strength of stimulus modulations (except when otherwise indicated).

Repeated presentations of the stimulus over trials were done in two ways: (1) in the replicate stimulus condition we injected the exact same realization of stimulus currents into each cell in every trial, (2) in the non-replicate stimulus condition we injected different realizations of the currents in every trial, with different realization of both s^β(t) and , what caused trial-to-trial fluctuations in the stimulus input. We modelled the stimulus as an injected current instead of a barrage of pre-synaptic spikes so that sensory input per se did not constitute an uncontrolled source of variability in the replicate stimulus condition.

Simulation details

The network model was implemented in Python using the Brian simulator version 1.4 (ref. 63). The network model code is available at ModelDB (https://senselab.med.yale.edu/ModelDB/). We used the Euler integration method with a time step of 0.1 ms. We simulated fixed-duration trials with a stimulus duration of 2 s, as in experimental settings^2,9. Stimulus presentation was preceded by a 3-s interval to prevent transient effects due to initial conditions. The choice outcome of the network was determined by the population of the integration circuit (D1 or D2) with a higher population firing rate over the last 50 ms of the stimulus period. Results for a given parameter set are based on 2,000 repeated trials of the same network (same connectivity matrix) with random initial conditions as well as different realizations of the external background inputs into each circuit.

For the replicate stimulus condition (see above) we generated 100 distinct realizations of replicate stimuli and presented each of them over 100 repeated trials (in total 10,000 trials). Replicate stimuli that led to overly consistent responses (>95 % choices in one direction) were excluded from the analysis because there were too few trials yielding one of the choices to have a good estimate of CP (this was the case for 23 of the 100 replicate stimuli).

Slower decision dynamics (Fig. 3c) was realized by decreasing the efficacy of feedforward connections from the sensory to the integration circuit by 25% and increasing the temporal modulations of the stimulus (σ=1.33). This led to a longer ‘sensory integration window’ (1.057 s versus 0.675 s; see below).

The heterogeneous network (Figs 5 and 6 and Supplementary Figs 9-10) is identical to the homogeneous network, but not all sensory neurons receive stimulus and top-down inputs. We randomly split each sensory population in four neural groups of equal size that receive (1) both stimulus and top-down feedback inputs (S+FB+), (2) only stimulus (S+FB−), (3) only top-down (S−FB+) and (4) neither stimulus nor top-down (S−FB−). This was achieved by using an individual feedback strength and an individual amplitude of input modulations σ_{stim, k} for each neuron k. We set (no top-down) for neurons in FB− and (strong top-down) for neurons in FB+. We set σ_stim,k=0 and for neurons in S−, and and σ_ind,k=0 for neurons in S+. This yields different (identical) input currents into each cell in S− (S+), without changing the s.d. of input currents compared with the homogeneous network (see equation (1)). All other parameters were left unchanged.

Physiological data

Physiological recordings had previously been obtained by K.H. Britten, J.A. Movshon, W.T. Newsome, M.N. Shadlen and E. Zohary. Experimental details are described in refs 2, 13, 64. In brief, three adult macaque monkeys (Macaca mulatta, two male and one female) performed a fixed-duration motion direction discrimination task near psychophysical threshold while responses of single neurons^2,64 or pairs¹³ in MT/V5 were recorded. The stimuli, RDKs at various motion coherences, were matched to each neuron’s preference for stimulus size, speed and motion direction. The precise pattern of random dots of the kinematograms at each coherence was either different (non-replicate RDK) or the same across trials (replicate RDK). The experimental data sets are available in the Neural Signal Archive (www.neuralsignal.org; single units: nsa2004.1 and nsa2009.1; paired units: nsa2004.2 and nsa2012.1). Most single units were recorded either in the non-replicate or replicate condition, but a subset of 22 neurons was recorded under both conditions. For these neurons, the impact of stimulus fluctuations on Fano factor and CP was consistent with the data shown in Fig. 4a,c: the shift-corrected Fano factor was higher in the non-replicate than in the replicate condition, and CP was higher in the non-replicate compared with the replicate condition early (P=0.05), but not late (P=0.70) for a count window T=250 ms. Paired units were only recorded in the non-replicate condition, except for two neural pairs (emu034 and emu035) that were obtained for both replicate and non-replicate stimuli (Fig. 4b and Supplementary Fig. 8b).

We used recordings from two monkeys (E and W) and excluded data from a third monkey (J) because the average CPs (T=2 s) obtained in this monkey were only marginally above chance level² and were significantly smaller than for the other two monkeys (one-way ANOVA, F_(2,250)=6.71, P=0.0015; mean CP was 0.565±0.010 for monkey E with n=117, 0.561±0.012 for monkey W with n=67, and 0.509±0.012 for monkey J with n=67). Monkey J also had considerably higher psychophysical and neuronal thresholds than E and W⁶⁴. To be included in the analysis, neurons had to fulfil the following criteria: (1) more than 20 trials are available for the zero-coherence condition (2) in these trials there are at least five preferred and five non-preferred choices, (3) the average firing rate is higher than 1 sp s⁻¹, and (4) the neuron fired at least 100 spikes across all trials. Outlier trials in which the spike count deviated from the mean by more than 3 s.d. were excluded. Neurons from the paired-unit data set nsa2004.2 whose preferred direction differed by <35° were included in the single unit analysis (47 neurons from monkey E, all with non-replicate stimuli). This ensured that the direction of the stimulus did not differ from the preferred orientation of the neurons, causing a decrease in the magnitude of CP¹⁰. For the analysis of pairwise correlations (Fig. 6) we used 32 pairs (all from monkey E) whose preferred directions differed by <90°.

Spike count statistics and choice probabilities

After binning time using dt=1 ms, the spike train of neuron k in trial l is represented as a binary word that equals 1 if there is a spike in the interval (t, t+dt) and zero otherwise. The instantaneous spike count of neuron k in trial l over a count window (t–T/2, t+T/2) is defined as:

that is the discrete convolution of with the kernel K_T(t) which equals one in (–T/2, T/2) and zero otherwise.

The individual trial-averaged rate of neuron k (Supplementary Figs 1a-c and 8a) is defined as where the brackets represent the average over trials and T=50 ms. The instantaneous activity of population β in trial l (Figs 1b and 7a,b) is defined as with the sum running over the N_β cells of population β (T=50 ms). The population rate averaged over the trials yielding choice α (with α=1,2) is defined as . We label r_β,α(t) as preferred and non-preferred for α=β and α≠β, respectively (Fig. 2a,d and Supplementary Fig. 3a).

The instantaneous CP CP_k(t;T) of neuron k is obtained by classifying the spike counts across trials according to the choice yielded in each trial (that is, versus ). The CP_k(t; T) is defined as the area under the receiver operating characteristic curve obtained from these two distributions².

The spike count Fano factor for neuron k (Fig. 4a) is defined as the ratio of the spike count variance to the mean spike count:

where the variance is obtained over trials.

Spike count noise correlations were measured using the Pearson correlation coefficient of the spike counts of neuron k and k′ at times t and t′:

with the covariance and the variance obtained across trials (we have dropped the explicit dependence on T to ease the notation). The average over the population of pairs (k, k′) was denoted as ρ(t, t′) (Fig. 6a–g). We defined the instantaneous (non-lagged) correlation as . The correlation matrices ρ (t_i, t_j) in Fig. 6a–c were obtained by evaluating ρ(t, t′) at t_i=(i−1/2) T and t_j=(j−1/2) T, with i, j=1…8 and T=250 ms. Finally, we represented the diagonals of ρ(t_i, t_j), defined as constant, versus the time lag t_i–t_j as a way to visualize an instantaneous cross-correlogram (Fig. 6a,b insets).

To remove a potential influence of differences in the average spike count on the measured Fano factor⁶⁵, correlation and the CP^12,66, we used adjusted count windows of variable length to compute FF(t), ρ_kk′(t) and CP(t) (Fig. 4a–c). The spike count window for each cell k centred at each time point t was adjusted to (t−T′/2, t+T′/2) to contain exactly n_k spikes on average (across trials). The number n_k=r_kT, where r_k is the trial-averaged rate over the stimulus duration (2 s). The Fano factors FF(t) and correlations ρ_kk′(t) were very similar for fixed and adjusted count windows, as was the CP(t) in the non-replicate condition. For the replicate condition, where the trial-averaged rate shows strong temporal modulation²⁸ (see Supplementary Fig. 8a), CP(t) was smoother for adjusted windows. The finding that early CP(t) decreases in the replicate condition does not depend on the count window T (Fig. 4c) or whether we used fixed or adjusted count windows.

For the network model, we averaged CP_k(t; T) and ρ_kk′(t_i, t_j) over 100 randomly chosen neurons from populations E1 and E2 (or over all the pairs formed by these neurons) with a minimum firing rate of 1 sp s⁻¹. For the experimental data we averaged CP_k(t; T), FF_k(t; T) and ρ_kk′(t, t′) over a variable number of neurons and pairs (see legends of Figs 4, 5, 6 and Supplementary Fig. 8). Data analysis was restricted to trials with zero-coherence stimuli, except for the correlation measurements. Correlations ρ_kk′(t) of single pairs (Fig. 4b and Supplementary Fig. 8b) were calculated separately for the available stimulus coherences ranging from –51.2% to +51.2% and then averaged (negative coherences represent motion in the non-preferred direction). Correlation matrices ρ(t_i, t_j) (Fig. 6) were obtained using low motion coherences (–3.2, 0 and +3.2%) that yielded a comparable number of trials for each choice. This gave a total of n=64 conditions from the 32 cell pairs.

When analysing the experimental data, we modified equations (4) and (5) to obtain Fano factors and correlations to remove the impact of slow variations in firing rate across trials²⁵. We used the shift-corrected spike count covariance and variance defined as:

and

Results for Fano factors are shown using this correction. For spike count correlations we did not use the correction because it did not affect the estimation at small spike count window T (<250 ms) but yielded larger estimation errors for large T. Results for Fano factors and correlations did not qualitatively change with the application of the shift correction.

In the analysis of the dependence of CPs, Fano factors and correlations on T (Fig. 4, Supplementary Figs 4 and 8), we averaged CP_k(t; T), FF_k(t; T) and ρ_kk′(t) across the time points t=T/2, 3/2 T, 5/2 T, … so that the statistics come from non-overlapping spike count windows starting at stimulus onset (t=0). We did this for T=15, 30, 60, 125, 250, 500, 1,000 and 2,000 ms.

The CP correlation matrix (Fig. 5 and Supplementary Fig. 9) is defined as the Spearman’s rank correlation coefficient across neurons of the CP measured at two different time bins:

with CP_k(t; T) evaluated at times t_i=(i−1/2) T and t_j=(j−1/2) T, with i, j=1…8 and T=250 ms. The matrix C(t_i, t_j) was obtained for the network (Fig. 5b,d and Supplementary Fig. 9) using a similar number of neurons and trials as available in the MT data. We selected 160 neurons (40 of each group) and computed their CP time-courses based on n randomly selected trials (with n=100 in Fig. 5b,d; n=100, 200 and 2,000 trials in Supplementary Fig. 9). We calculated C(t_i, t_j) as the average across 1,000 different selection of trials from which s.e. values were obtained. All data analyses were carried out in MATLAB (The Mathworks).

Psychophysical data

Two adult macaque monkeys (Macaca mulatta, male) were trained to report with a reaching response the motion direction of a random dot kinematogram (RDK, see Supplementary Methods) along the horizontal axis with varying levels of motion coherence. On each trial we recorded both the monkey’s choice and the presented stimulus (that is, the dots positions in each frame). These data were used to compute average motion energy traces (Supplementary Fig. 7). The task was very similar to the classical fixed-duration version^2,13,64 (see Supplementary Methods for details). All surgical and behavioural procedures conformed to guidelines established by the National Institutes of Health and were approved by the Institutional Animal Care and Use Committee of Stanford University.

Psychophysical reverse correlation

We used psychophysical reverse correlation^9,67 to measure the amplitude and time-course of the impact of stimulus fluctuations on the decision. The psychophysical kernels were computed as the difference of the average stimulus leading to each of the two possible choices. For the experimental data, stimulus fluctuations were estimated by computing the motion energy contained in the RDKs using appropriate spatio-temporal filters^29,68. For details, see Supplementary Methods.

Additional information

How to cite this article: Wimmer, K. et al. Sensory integration dynamics in a hierarchical network explains choice probabilities in cortical area MT. Nat. Commun. 6:6177 doi: 10.1038/ncomms7177 (2015).

References

Gold, J. I. & Shadlen, M. N. The neural basis of decision making. Annu. Rev. Neurosci. 30, 535–574 (2007).
Article CAS Google Scholar
Britten, K. H., Newsome, W. T., Shadlen, M. N., Celebrini, S. & Movshon, J. A. A relationship between behavioral choice and the visual responses of neurons in macaque MT. Vis. Neurosci. 13, 87–100 (1996).
Article CAS Google Scholar
Nienborg, H., Cohen, M. R. & Cumming, B. G. Decision-related activity in sensory neurons: correlations among neurons and with behavior. Annu. Rev. Neurosci. 35, 463–483 (2012).
Article CAS Google Scholar
Smith, J. E. T., Masse, N. Y., Zhan, C. A. & Cook, E. P. inVisual Cortex—Current Status and Perspectives eds Molotchnikoff S., Rouat J. 161–184InTech (2012).
Sasaki, R. & Uka, T. Dynamic readout of behaviorally relevant signals from area MT during task switching. Neuron 62, 147–157 (2009).
Article CAS Google Scholar
Gu, Y., Angelaki, D. E. & Deangelis, G. C. Neural correlates of multisensory cue integration in macaque MSTd. Nat. Neurosci. 11, 1201–1210 (2008).
Article CAS Google Scholar
Uka, T. & DeAngelis, G. C. Contribution of area MT to stereoscopic depth perception: choice-related response modulations reflect task strategy. Neuron 42, 297–310 (2004).
Article CAS Google Scholar
Dodd, J. V., Krug, K., Cumming, B. G. & Parker, A. J. Perceptually bistable three-dimensional figures evoke high choice probabilities in cortical area MT. J. Neurosci. 21, 4809–4821 (2001).
Article CAS Google Scholar
Nienborg, H. & Cumming, B. G. Decision-related activity in sensory neurons reflects more than a neuron’s causal effect. Nature 459, 89–92 (2009).
Article ADS CAS Google Scholar
Cohen, M. R. & Newsome, W. T. Estimates of the contribution of single neurons to perception depend on timescale and noise correlation. J. Neurosci. 29, 6635–6648 (2009).
Article CAS Google Scholar
Shadlen, M. N., Britten, K. H., Newsome, W. T. & Movshon, J. A. A computational analysis of the relationship between neuronal and behavioral responses to visual motion. J. Neurosci. 16, 1486–1510 (1996).
Article CAS Google Scholar
Haefner, R. M., Gerwinn, S., Macke, J. H. & Bethge, M. Inferring decoding strategies from choice probabilities in the presence of correlated variability. Nat. Neurosci. 16, 235–242 (2013).
Article CAS Google Scholar
Zohary, E., Shadlen, M. N. & Newsome, W. T. Correlated neuronal discharge rate and its implications for psychophysical performance. Nature 370, 140–143 (1994).
Article ADS CAS Google Scholar
Cohen, M. R. & Newsome, W. T. Context-dependent changes in functional circuitry in visual area MT. Neuron 60, 162–173 (2008).
Article CAS Google Scholar
Krug, K. A common neuronal code for perceptual processes in visual cortex? Comparing choice and attentional correlates in V5/MT. Philos. Trans. R Soc. Lond. B Biol. Sci. 359, 929–941 (2004).
Article Google Scholar
Herrington, T. M. & Assad, J. A. Neural activity in the middle temporal area and lateral intraparietal area during endogenously cued shifts of attention. J. Neurosci. 29, 14160–14176 (2009).
Article CAS Google Scholar
Roelfsema, P. R. & Spekreijse, H. The representation of erroneously perceived stimuli in the primary visual cortex. Neuron 31, 853–863 (2001).
Article CAS Google Scholar
Cohen, M. R. & Maunsell, J. H. R. A neuronal population measure of attention predicts behavioral performance on individual trials. J. Neurosci. 30, 15241–15253 (2010).
Article CAS Google Scholar
Treue, S. & Martinez-Trujillo, J. C. Feature-based attention influences motion processing gain in macaque visual cortex. Nature 399, 575–579 (1999).
Article ADS CAS Google Scholar
Roelfsema, P. R., Lamme, V. A. F. & Spekreijse, H. Synchrony and covariation of firing rates in the primary visual cortex during contour grouping. Nat. Neurosci. 7, 982–991 (2004).
Article CAS Google Scholar
Cohen, M. R. & Maunsell, J. H. R. Attention improves performance primarily by reducing interneuronal correlations. Nat. Neurosci. 12, 1594–1600 (2009).
Article CAS Google Scholar
Mitchell, J. F., Sundberg, K. A. & Reynolds, J. H. Spatial attention decorrelates intrinsic activity fluctuations in macaque area V4. Neuron 63, 879–888 (2009).
Article CAS Google Scholar
Renart, A. et al. The asynchronous state in cortical circuits. Science 327, 587–590 (2010).
Article ADS CAS Google Scholar
Bair, W. & O’Keefe, L. P. The influence of fixational eye movements on the response of neurons in area MT of the macaque. Vis. Neurosci. 15, 779–786 (1998).
Article CAS Google Scholar
Bair, W., Zohary, E. & Newsome, W. T. Correlated firing in macaque visual area MT: time scales and relationship to behavior. J. Neurosci. 21, 1676–1697 (2001).
Article CAS Google Scholar
Kohn, A. & Smith, M. A. Stimulus dependence of neuronal correlation in primary visual cortex of the macaque. J. Neurosci. 25, 3661–3673 (2005).
Article CAS Google Scholar
Wang, X.-J. Probabilistic decision making by slow reverberation in cortical circuits. Neuron 36, 955–968 (2002).
Article CAS Google Scholar
Bair, W. & Koch, C. Temporal precision of spike trains in extrastriate cortex of the behaving macaque monkey. Neural Comput. 8, 1185–1202 (1996).
Article CAS Google Scholar
Kiani, R., Hanks, T. D. & Shadlen, M. N. Bounded integration in parietal cortex underlies decisions even when viewing duration is dictated by the environment. J. Neurosci. 28, 3017–3029 (2008).
Article CAS Google Scholar
Amit, D. J. & Brunel, N. Model of global spontaneous activity and local structured activity during delay periods in the cerebral cortex. Cereb. Cortex N. Y. N 1991 7, 237–252 (1997).
CAS Google Scholar
Vreeswijk, C. van & Sompolinsky, H. Chaos in neuronal networks with balanced excitatory and inhibitory activity. Science 274, 1724–1726 (1996).
Article ADS Google Scholar
Shadlen, M. N. & Newsome, W. T. The variable discharge of cortical neurons: implications for connectivity, computation, and information coding. J. Neurosci. 18, 3870–3896 (1998).
Article CAS Google Scholar
Roxin, A. & Ledberg, A. Neurobiological models of two-choice decision making can be reduced to a one-dimensional nonlinear diffusion equation. PLoS. Comput. Biol. 4, e1000046 (2008).
Article ADS MathSciNet Google Scholar
Polk, A., Litwin-Kumar, A. & Doiron, B. Correlated neural variability in persistent state networks. Proc. Natl Acad. Sci. USA 109, 6295–6300 (2012).
Article ADS CAS Google Scholar
Cain, N. & Shea-Brown, E. Impact of correlated neural activity on decision-making performance. Neural Comput. 25, 289–327 (2013).
Article MathSciNet Google Scholar
Britten, K. H., Shadlen, M. N., Newsome, W. T. & Movshon, J. A. Responses of neurons in macaque MT to stochastic motion signals. Vis. Neurosci. 10, 1157–1169 (1993).
Article CAS Google Scholar
Kiani, R. & Shadlen, M. N. Representation of confidence associated with a decision by neurons in the parietal cortex. Science 324, 759–764 (2009).
Article ADS CAS Google Scholar
Wong, K.-F. & Wang, X.-J. A recurrent network mechanism of time integration in perceptual decisions. J. Neurosci. 26, 1314–1328 (2006).
Article CAS Google Scholar
Huang, X. & Lisberger, S. G. Noise correlations in cortical area MT and their potential impact on trial-by-trial variation in the direction and speed of smooth-pursuit eye movements. J. Neurophysiol. 101, 3012–3030 (2009).
Article Google Scholar
Gu, Y. et al. Perceptual learning reduces interneuronal correlations in macaque visual cortex. Neuron 71, 750–761 (2011).
Article CAS Google Scholar
Litwin-Kumar, A. & Doiron, B. Slow dynamics and high variability in balanced cortical networks with clustered connections. Nat. Neurosci. 15, 1498–1505 (2012).
Article CAS Google Scholar
Deco, G. & Hugues, E. Neural network mechanisms underlying stimulus driven variability reduction. PLoS Comput. Biol. 8, e1002395 (2012).
Article ADS MathSciNet CAS Google Scholar
Ponce-Alvarez, A., Thiele, A., Albright, T. D., Stoner, G. R. & Deco, G. Stimulus-dependent variability and noise correlations in cortical MT neurons. Proc. Natl Acad. Sci. USA 110, 13162–13167 (2013).
Article ADS CAS Google Scholar
Wimmer, K., Nykamp, D. Q., Constantinidis, C. & Compte, A. Bump attractor dynamics in prefrontal cortex explains behavioral precision in spatial working memory. Nat. Neurosci. 17, 431–439 (2014).
Article CAS Google Scholar
Ben-Yishai, R., Bar-Or, R. L. & Sompolinsky, H. Theory of orientation tuning in visual cortex. Proc. Natl Acad. Sci. USA 92, 3844–3848 (1995).
Article ADS CAS Google Scholar
Rolls, E. T. & Deco, G. Prediction of decisions from noise in the brain before the evidence is provided. Front Neurosci. 5, 33 (2011).
Article Google Scholar
Abbott, L. F. & Dayan, P. The effect of correlated variability on the accuracy of a population code. Neural Comput. 11, 91–101 (1999).
Article CAS Google Scholar
Sompolinsky, H., Yoon, H., Kang, K. & Shamir, M. Population coding in neuronal systems with correlated noise. Phys. Rev. E Stat. Nonlin. Soft Matter Phys. 64, 051904 (2001).
Article ADS CAS Google Scholar
Ecker, A. S., Berens, P., Tolias, A. S. & Bethge, M. The effect of noise correlations in populations of diversely tuned neurons. J. Neurosci. 31, 14272–14283 (2011).
Article CAS Google Scholar
Averbeck, B. B., Latham, P. E. & Pouget, A. Neural correlations, population coding and computation. Nat. Rev. Neurosci. 7, 358–366 (2006).
Article CAS Google Scholar
Beck, J., Bejjanki, V. R. & Pouget, A. Insights from a simple expression for linear fisher information in a recurrently connected population of spiking neurons. Neural Comput. 23, 1484–1502 (2011).
Article MathSciNet Google Scholar
Tkacik, G., Prentice, J. S., Balasubramanian, V. & Schneidman, E. Optimal population coding by noisy spiking neurons. Proc. Natl Acad. Sci. USA 107, 14419–14424 (2010).
Article ADS CAS Google Scholar
Gold, J. I., Law, C.-T., Connolly, P. & Bennur, S. The relative influences of priors and sensory evidence on an oculomotor decision variable during perceptual learning. J. Neurophysiol. 100, 2653–2668 (2008).
Article Google Scholar
Donner, T. H., Siegel, M., Fries, P. & Engel, A. K. Buildup of choice-predictive activity in human motor cortex during perceptual decision making. Curr. Biol. 19, 1581–1585 (2009).
Article CAS Google Scholar
Pooresmaeili, A., Poort, J. & Roelfsema, P. R. Simultaneous selection by object-based attention in visual and frontal cortex. Proc. Natl Acad. Sci. USA 111, 6467–6472 (2014).
Article ADS CAS Google Scholar
Huk, A. C. & Shadlen, M. N. Neural activity in macaque parietal cortex reflects temporal integration of visual motion signals during perceptual decision making. J. Neurosci. 25, 10420–10436 (2005).
Article CAS Google Scholar
Hanks, T. D. et al. Distinct relationships of parietal and prefrontal cortices to evidence accumulation. Nature doi:10.1038/nature14066 (2015).
Resulaj, A., Kiani, R., Wolpert, D. M. & Shadlen, M. N. Changes of mind in decision-making. Nature 461, 263–266 (2009).
Article ADS CAS Google Scholar
Drugowitsch, J., Moreno-Bote, R., Churchland, A. K., Shadlen, M. N. & Pouget, A. The cost of accumulating evidence in perceptual decision making. J. Neurosci. 32, 3612–3628 (2012).
Article CAS Google Scholar
Fleming, S. M., Maloney, L. T. & Daw, N. D. The irrationality of categorical perception. J. Neurosci. 33, 19060–19070 (2013).
Article CAS Google Scholar
Haefner, R. M., Berkes, P. & Fiser, J. Perceptual decision-making as probabilistic inference by neural sampling. Preprint at http://arxiv.org/abs/1409.0257 (2014).
Nordlie, E., Gewaltig, M.-O. & Plesser, H. E. Towards reproducible descriptions of neuronal network models. PLoS Comput. Biol. 5, e1000456 (2009).
Article MathSciNet Google Scholar
Goodman, D. F. M. & Brette, R. The Brian simulator. Front Neurosci. 3, 192–197 (2009).
Article Google Scholar
Britten, K. H., Shadlen, M. N., Newsome, W. T. & Movshon, J. A. The analysis of visual motion: a comparison of neuronal and psychophysical performance. J. Neurosci. 12, 4745–4765 (1992).
Article CAS Google Scholar
Churchland, M. M. et al. Stimulus onset quenches neural variability: a widespread cortical phenomenon. Nat. Neurosci. 13, 369–378 (2010).
Article CAS Google Scholar
Kang, I. & Maunsell, J. H. R. Potential confounds in estimating trial-to-trial correlations between neuronal response and behavior using choice probabilities. J. Neurophysiol. 108, 3403–3415 (2012).
Article Google Scholar
Neri, P. & Levi, D. M. Receptive versus perceptive fields from the reverse-correlation viewpoint. Vision Res. 46, 2465–2474 (2006).
Article Google Scholar
Adelson, E. H. & Bergen, J. R. Spatiotemporal energy models for the perception of motion. J. Opt. Soc. Am. A 2, 284–299 (1985).
Article ADS CAS Google Scholar
Rocha, J., de la, Doiron, B., Shea-Brown, E., Josić, K. & Reyes, A. Correlation between neural spike trains increases with firing rate. Nature 448, 802–806 (2007).
Article ADS Google Scholar

Download references

Acknowledgements

We thank K.H. Britten, J.A. Movshon, W.T. Newsome, M.N. Shadlen and E. Zohary for making their data available in the ‘Neural Signal Archive’ (http://www.neuralsignal.org/), and W. Bair for maintaining this database. We thank B. Cumming, H. Sprekeler and N. Rubin for fruitful discussions, and Z. Mainen, W.T. Newsome, H. Sprekeler and E. Shea-Brown for comments on the manuscript. We wish to acknowledge W.T. Newsome and R. Kiani for help with the acquisition of the psychophysical data. This work was funded by the Spanish Ministry of Economy and Competitiveness together with the European Regional Development Fund (grants BFU2009-09537 and BFU2012-34838 to A.C., SAF2010-15730, SAF2013-46717-R and RYC-2009-04829 to J.R., BFU2012-33413 and RYC-2011-08755 to A.Ro.), the German Research Foundation (fellowship Wi 3767/1-1 to K.W.), the Champalimaud Foundation (A.R. and D.P.), the EU (Marie Curie grants PCIG11-GA-2012-322339 to A.R., PIRG07-GA-2010-268382 to J.R. and BIOTRACK contract PCOFUND-GA-2008-229673 to A.Ro.), the Fundação para a Ciência e a Tecnologia (grant SFRH/BD/51258/2010 to D.P.), and the Howard Hughes Medical Institute (D.P.). Part of the work was carried out at the Esther Koplowitz Centre, Barcelona.

Author information

Authors and Affiliations

Institut d'Investigacions Biomèdiques August Pi i Sunyer (IDIBAPS), C/Rosselló 149, Barcelona, 08036, Spain
Klaus Wimmer, Albert Compte, Alex Roxin & Jaime de la Rocha
Centre de Recerca Matemàtica (CRM), Campus de Bellaterra, Edifici C, Barcelona, 08193, Spain
Alex Roxin
Champalimaud Neuroscience Programme, Champalimaud Centre for the Unknown, Lisbon, 1400-038, Portugal
Diogo Peixoto & Alfonso Renart
Department of Neurobiology, Stanford University, 299 Campus Drive West, Stanford, 94305-5125, California, USA
Diogo Peixoto

Authors

Klaus Wimmer
View author publications
You can also search for this author in PubMed Google Scholar
Albert Compte
View author publications
You can also search for this author in PubMed Google Scholar
Alex Roxin
View author publications
You can also search for this author in PubMed Google Scholar
Diogo Peixoto
View author publications
You can also search for this author in PubMed Google Scholar
Alfonso Renart
View author publications
You can also search for this author in PubMed Google Scholar
Jaime de la Rocha
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

J.R., A.C. and K.W. conceived the study and developed the computational model. All the authors contributed to the design of the study and to the interpretation of the data. D.P. collected the psychophysical data; K.W. performed the simulations and analysed the simulation and psychophysical data; K.W. and J.R. analysed the physiological data; K.W. and J.R. wrote the paper, with contributions from the rest of the authors.

Corresponding author

Correspondence to Jaime de la Rocha.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Supplementary information

Supplementary Information

Supplementary Figures 1-10, Supplementary Tables 1-4, Supplementary Methods and Supplementary References (PDF 805 kb)

Rights and permissions

This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

Reprints and permissions

About this article

Cite this article

Wimmer, K., Compte, A., Roxin, A. et al. Sensory integration dynamics in a hierarchical network explains choice probabilities in cortical area MT. Nat Commun 6, 6177 (2015). https://doi.org/10.1038/ncomms7177

Download citation

Received: 27 November 2014
Accepted: 29 December 2014
Published: 04 February 2015
DOI: https://doi.org/10.1038/ncomms7177

This article is cited by

Priority coding in the visual system
- Nicole C. Rust
- Marlene R. Cohen
Nature Reviews Neuroscience (2022)
Flexible categorization in perceptual decision making
- Genís Prat-Ortega
- Klaus Wimmer
- Jaime de la Rocha
Nature Communications (2021)
Adaptive circuit dynamics across human cortex during evidence accumulation in changing environments
- Peter R. Murphy
- Niklas Wilming
- Tobias H. Donner
Nature Neuroscience (2021)
Decision-related feedback in visual cortex lacks spatial selectivity
- Katrina R. Quinn
- Lenka Seillier
- Hendrikje Nienborg
Nature Communications (2021)
Proactive and reactive accumulation-to-bound processes compete during perceptual decisions
- Lluís Hernández-Navarro
- Ainhoa Hermoso-Mendizabal
- Alexandre Hyafil
Nature Communications (2021)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.