Choice selective inhibition drives stability and competition in decision circuits

Roach, James P.; Churchland, Anne K.; Engel, Tatiana A.

doi:10.1038/s41467-023-35822-8

Download PDF

Article
Open access
Published: 10 January 2023

Choice selective inhibition drives stability and competition in decision circuits

Nature Communications volume 14, Article number: 147 (2023) Cite this article

4435 Accesses
4 Citations
15 Altmetric
Metrics details

Subjects

Abstract

During perceptual decision-making, the firing rates of cortical neurons reflect upcoming choices. Recent work showed that excitatory and inhibitory neurons are equally selective for choice. However, the functional consequences of inhibitory choice selectivity in decision-making circuits are unknown. We developed a circuit model of decision-making which accounts for the specificity of inputs to and outputs from inhibitory neurons. We found that selective inhibition expands the space of circuits supporting decision-making, allowing for weaker or stronger recurrent excitation when connected in a competitive or feedback motif. The specificity of inhibitory outputs sets the trade-off between speed and accuracy of decisions by either stabilizing or destabilizing the saddle-point dynamics underlying decisions in the circuit. Recurrent neural networks trained to make decisions display the same dependence on inhibitory specificity and the strength of recurrent excitation. Our results reveal two concurrent roles for selective inhibition in decision-making circuits: stabilizing strongly connected excitatory populations and maximizing competition between oppositely selective populations.

Computational complexity drives sustained deliberation

Article Open access 24 April 2023

Initial conditions combine with sensory evidence to induce decision-related dynamics in premotor cortex

Article Open access 16 October 2023

Adaptive circuit dynamics across human cortex during evidence accumulation in changing environments

Article 26 April 2021

Introduction

Perceptual decision-making requires neural circuits to integrate evidence and classify a stimulus to trigger the correct behavioral response. Neurons in a range of cortical areas modulate their firing rate to signal animal’s choice¹. The functional properties of decision-making neural circuits have been extensively studied and modeled^{2,3,4,5,6,7,8,9}. Central to the function of these circuit models are attractors in the activity space which characterize the population’s encoding of a given choice. The attractor mechanism driving the decision-making activity in these models relies on structured recurrent connections between populations of excitatory neurons that are each selective for a different choice^8,10,11. Inhibitory neurons, in this view, are merely supporting actors facilitating competition and providing balance to the excitatory neurons.

Since the canonical models of decision-making circuits were built, the diversity and complexity of inhibitory neurons within the cortex have been characterized in increasing detail¹². In primary sensory areas, inhibitory neurons are generally more broadly tuned¹³ and more densely connected to neighboring excitatory neurons^14,15. These inhibitory neurons reliably modulate spike output to reflect stimulus features and have highly specific connectivity to surrounding excitatory neurons^16,17. The stimulus selectivity of inhibitory neurons is enhanced by learning and attention¹⁸ suggesting that task dependent modulation of inhibitory activity is necessary for cognition. Beyond the primary sensory cortex, stimulus information and animal choice can be decoded from the activity of inhibitory neurons in secondary sensory and association areas indicating a role for selective inhibition in higher cognitive functions, such as decision making^19,20,21. While there is growing evidence that the activity and connectivity of inhibitory neurons is as complex as excitatory neurons, how the selectivity of inhibitory activity and the diversity of their connections affect the decision-making function of cortical circuits is still unknown.

To reveal the role of choice selective inhibitory neurons in decision-making computations we extended a well established mean-field model of decision-making circuits⁴ to account for the presence of inhibitory choice selectivity. Our model allows us to parametrically alter the specificity of connections between two choice-selective excitatory and two choice-selective inhibitory populations. Through analysis of this model, we found that while inhibition must drive competition between choice-selective excitatory populations it must also stabilize activity driven by recurrent excitation at the same time. These two concurrent roles are mediated by inhibitory connections to the excitatory populations and either role can be enhanced by structured inhibitory connectivity. We found that inhibitory selectivity expands the space of possible circuits which support decision-making by enhancing either a competitive or stabilizing role for inhibition. In addition, the connectivity motif between choice selective populations alters the underlying attractor dynamics and modulates the decision-making performance to prioritize speed or accuracy. We generalized these results by training recurrent neural networks (RNNs) to perform the same decision-making task. After training, RNNs had both excitatory and inhibitory units significantly selective for choice and displayed a similar dependence between the specificity of excitatory and inhibitory connections found in the mean-field model. Finally, we perturbed inhibitory neuron activity in these models to probe the dynamical regime in which the circuit operates. We found two regimes in which circuits respond differently to perturbations of inhibitory neurons: one in which the competitive role dominates and the other in which the stabilizing role dominates. Our work demonstrates that choice selective inhibition impacts decision-making behavior by enhancing either the competitive or the stabilizing role for inhibition in the circuit. These results generate testable predictions for perturbation experiments.

Results

We consider circuits where two excitatory (E) populations integrate dedicated streams of sensory evidence to produce a categorical choice (Fig. 1a). In contrast to previous circuit models of decision-making with global inhibition, we include two inhibitory (I) populations which can inherit choice selectivity from excitatory neurons (Methods). We model the circuit dynamics using two-dimensional mean-field equations where the mean presynaptic activation of N-methyl-D-aspartate (NMDA) receptor of the two excitatory (E₁ and E₂) populations are the dynamic variables⁴. The average strength of connections between the four choice selective populations is controlled by a specificity parameter γ. For each of three connection classes (E to E, E to I, and I to E; Fig. 1b), γ_EE, γ_EI, and γ_IE set the balance of connection strengths between populations with the same and opposite choice selectivity (Fig. 1c). For example, (1 + γ_EE) is the strength of feedback connections within excitatory populations selective for the same choice, and (1 − γ_EE) is the strength of connections between excitatory populations selective for the opposite choice. We keep γ_EE positive due to the importance of recurrent feedback excitation in the function of these circuits⁴. When γ_EE = 1, each of E₁ and E₂ have a strong self-excitatatory feedback and are not connected to each other. When γ_EE = 0, the strengths of excitatory connections between and within E₁ and E₂ are all equal. Inhibitory choice selectivity is controlled by γ_EI defined in the same way, which is also positive because inhibitory neurons inherit choice and stimulus information from the excitatory neurons. When γ_EI = 1, inhibitory population I₁ receives excitatory inputs from E₁ but not E₂ and vice versa. When γ_EI = 0, each I₁ and I₂ receive equal excitatory inputs from E₁ and E₂. Thus, inhibitory activity is not choice selective when γ_EI = 0 because inhibitory neurons receive equal input from both excitatory populations. Inhibitory choice selectivity emerges as γ_EI increases (Fig. 1d).

**Fig. 1: A mean-field circuit model of decision making with choice-selective inhibition.**

For inhibitory choice selectivity to have any effect on circuit function, the outputs of inhibitory populations must be structured (i.e. γ_IE ≠ 0; Fig. 1c). The specificity of inhibitory outputs γ_IE can range between [−1, 1] with negative values favoring connections between E and I populations with opposite choice preference and positive values favoring connections between E and I populations with the same choice preference. When γ_IE = 1, I₁ sends inhibitory output to E₁ but not E₂. When γ_IE = −1, I₁ sends inhibitory output to E₂ but not E₁. Thus, the specificity of inhibitory output connectivity defines three circuit motifs: contraspecific for γ_IE < 0, ipsispecific for γ_IE > 0, and nonspecific for γ_IE = 0.

In any decision-making circuit, inhibition concurrently fulfills two roles. The first is providing the substrate for competition between the excitatory populations, and the second is stabilizing the self-amplification driven by strongly recurrent excitatory populations. Both of these roles must be fulfilled for a circuit to function, but specific connections to and from inhibitory populations could enhance one of these roles (Fig. 1c). Specifically, ipsispecific inhibition can promote stabilizing feedback and contraspecific inhibition can maximize competition.

In response to an input stimulus, the circuit can produce different choice outcomes by changing the firing rates of the excitatory populations. Circuits report a choice by persistently raising the firing rate of one excitatory population at least 15 Hz above the other. Trials where this separation does not occur are considered invalid and not included in the calculation of psychometric or chronometric functions (Fig. 1e, Methods). We also require that prior to the stimulus onset, the circuit maintains low, symmetric activation of excitatory neurons. Persistence of the decision after stimulus offset allows for a choice readout to be made even after a significant delay and its utility led us to include the working memory of a choice in our criteria for inclusion as a circuit supporting decision-making (Fig. 1e). These dynamics are governed by eight fixed points across the phase planes of unstimulated and stimulated system, which are essential for the functional decision-making and working memory behavior (Fig. 1e, f). Prior to stimulus onset, both excitatory populations maintain low symmetric activation, which is set by an attractor located near the origin in the unstimulated phase plane. Following stimulus onset, the firing rate for both populations increases as the system approaches a saddle point along the stable manifold which acts as a separatrix between two choice attractors in the stimulated phase plane. Following stimulus offset, the system returns to its unstimulated phase plane and the choice of the circuit is preserved by one of two working memory attractors.

Inhibitory connection specificity expands the space of circuits that support decision making

Using the mean-field model, we investigated how the circuit’s ability to perform decision-making depends on the inhibitory connectivity structure. Specifically, we determined how choice-selective inhibition affects the presence of the eight fixed points (three attractors and two saddle points in the unstimulated phase plane, and two attractors and one saddle in the stimulated phase plane) governing decision-making behavior. We sampled the specificity parameter space to identify circuits which support these eight fixed points (Fig. 2a). We found that a broad range of circuit configurations can support decision making. There are two components of inhibitory choice selectivity which rely on specific connections to and from inhibitory populations. The first is the degree of choice selective firing by inhibitory neurons that is controlled by γ_EI. The second is the degree to which inhibitory populations have a specific effect on excitatory neurons that is controlled by γ_IE. We combine these two components into a specificity index γ_EIγ_IE, which is negative for contraspecific and positive for ipsispecific circuits following the sign of γ_IE. The specificity of excitatory and inhibitory connections is highly correlated in circuits supporting decision making (Fig. 2b). When inhibition is nonselective (γ_EI = 0) or nonspecific (γ_IE = 0), the strength of recurrent excitation (γ_EE) is highly constrained and deviations from a narrow range leads to the loss of one of the essential fixed points (Fig. 2c). For circuits with selective inhibition, a wider range of γ_EE will support decision making as long as a complementary inhibitory motif is present. For low γ_EE, the inhibitory motif must be contraspecific (γ_EIγ_IE < 0, Fig. 2b and d left) and for high γ_EE it must be ipsispecific (γ_EIγ_IE > 0, Fig. 2b, d right). A contraspecific inhibitory motif can promote competition in circuits where excitatory feedback connections are insufficiently strong to amplify firing rate differences between choice selective populations. An ispispecific inhibitory motif can stabilize excitatory feedback to prevent inadvertent winner-take-all dynamics in the absence of stimulus in circuits with strong excitatory specificity. By enhancing either the competitive or stabilizing role, circuits with choice selective inhibitory populations can support decision making for a wider range of γ_EE (Fig. 2b).

**Fig. 2: Choice-selective inhibition expands the space of circuits supporting decision making.**

The emphasis on competition or stability can also be seen in which fixed points are lost when connection specificity between excitatory and inhibitory populations are not complementary. When γ_EE is low, nonspecific and ipsispecific circuits lack the fixed points representing choice both in the presence and absence of stimulation as well as the saddle point during the stimulus (Fig. 2d left, Supplementary Fig. 1), because recurrent excitation is too weak to drive competition alone. Contraspecific inhibition paired with low γ_EE restores these fixed points by emphasizing competition between populations selective for opposite choices. These fixed points emerge sequentially as the inhibitory motif becomes more contraspecific: first the choice attractors appear, followed by the saddle point, and finally by the working memory attractors (arrow in Fig. 2d left). For moderate γ_EE, nonspecific circuits have all eight necessary fixed points, but deviations to a contraspecific motif cause the loss of the attractor for the low initial state, whereas deviations to an ipsispecific motif cause the loss of the working memory attractors, then saddle point, and then choice attactors (arrow in Fig. 2d center, Supplementary Fig. 1). For circuits with high γ_EE to support decision making, inhibitory motif must be ipsispecific, as nonspecific and contraspecific circuits lack the initial low activation state attractor (Fig. 1d right, Supplementary Fig. 1). The trade-off between competition and stability across contraspecific and ipsispecific circuits is also evident in the size of choice-selective populations that support decision-making (Supplementary Fig. 2).

Specific connections between choice-selective inhibitory populations may also impact the attractors underlying decision-making. For example, competitive inhibitory-inhibitory connections can mediate disinhibition in contraspecific circuits^22,23. We therefore investigated the effect of inhibitory-to-inhibitory connection specificity on decision-making dynamics. We extended our mean-field approach to explicitly model the activity of two choice-selective excitatory and two choice-selective inhibitory populations (Methods). This four-variable model produces the firing-rate dynamics and attractors similar to the original two-variable mean-field model (Supplementary Fig. 3a–c). In the four-variable model, we controlled the balance of connection strength between choice-selective inhibitory populations in the same manner as for other connection classes using a specificity parameter γ_II, which like γ_IE ranges between [−1, 1]. We sampled the four-dimensional specificity parameter space to identify points where the eight decision-making attractors are present (Supplementary Fig. 3d). As with the two-variable model, the main factor determining whether a circuit has the necessary fixed points is the linear relationship between γ_EE and γ_EIγ_IE. Inhibitory-to-inhibitory connection specificity γ_II has a limited impact on the presence of the fixed points (Supplementary Fig. 3d, cf. Fig. 2b). Circuits with negative γ_II are hyper-competitive and lose the low activation state attractor.

Inhibitory motif controls the speed versus accuracy trade-off

The roles enhanced by contra- and ipsispecific inhibititory motifs lead to differences in performance of decision circuits. In circuits with moderate strengths of recurrent excitation, all three motifs can support decision making for the same γ_EE. We found that circuits with three inhibitory motifs differ in choice accuracy on difficult trials where stimulus strength is weak (Fig. 3a). Relative to a circuit with nonspecific inhibitory outputs (γ_IE = 0), ipsipecific circuits are more accurate at classifying difficult stimuli but more often fail to separate the outputs sufficiently producing invalid trials (Fig. 3b). Contraspecific circuits, on the other hand, have lower accuracy for difficult stimuli. In addition, contraspecific circuits have a stimulus independent rate of trial failure attributable to trials where the firing rates of choice-selective populations separate prior to the stimulus onset (Fig. 3b), highlighting how these circuits are primed for competitive dynamics. It is well known that decision accuracy and decision time are linked through the speed-accuracy trade-off, where longer integration times lead to more accurate decisions^24,25,26. Ipsispecific circuits could be more accurate at the expense of speed, so we compared the average time it takes circuits to cross the decision threshold for each stimulus strength as a proxy for decision time. Ipsispecific circuits do indeed arrive at choices more slowly than the less accurate contraspecific circuits (Fig. 3c). These differences in behavioral performance indicate a speed versus accuracy trade-off which is mediated by the specificity of connections between choice-selective populations in the circuit (also evident in the four-variable model, Supplementary Fig. 3e). These performance outcomes again highlight the roles enhanced by ipsispecific and contraspecific inhibition: the contraspecific motif primes a circuit for competition, whereas the ipsispecific motif promotes stability, lengthening integration times.

**Fig. 3: Inhibitory circuit motifs mediate the speed-accuracy trade-off in decision-making.**

We can understand the speed-accuracy trade-off between ipsi- and contraspecific circuits by analyzing the dynamics around the saddle point. Differences in these dynamics are seen by comparing single-trial trajectories of ipsi-, non-, and contrapecific circuits in response to the neutral stimulus (Fig. 3d). At the trial start, both choice-selective populations are symmetrically activated and the trajectory moves along the stable manifold toward the saddle point. The circuit activity deviates to a choice attractor after approaching the saddle. Contra- and ipsispecific circuits differ in both how far along the stable manifold the activity progresses and how quickly it moves toward the choice attractor once it deviates. We can estimate how quickly the dynamics will leave the neighborhood of the saddle point with the time-constant τ_slow, which is the time-constant of dynamics moving along the unstable manifold of the saddle point^4,27. Changing the circuit motif from contraspecific to ipsispecific by increasing γ_EIγ_IE leads to an increase in τ_slow (Fig. 3e) and slowing down the pace of decisions (Fig. 3f). The divergence of τ_slow indicates that ipsispecific inhibition stabilizes the saddle point until at high γ_EIγ_IE a bifurcation occurs and the saddle point becomes an attractor with a symmetric high activity state (Fig. 3g). This bifurcation leads to the system stabilizing in a state where the firing rates of two choice-selective populations do not sufficiently separate on neutral and difficult stimuli trials, a state where the circuit fails to produce a decision. Easy stimuli impose a stronger asymmetry on the phase plane⁴ allowing circuits with highly ipsispecific inhibition to converge to a choice on easy trials (Supplementary Fig. 4).

Strong ipsispecific inhibition destabilizes working memory

The inhibitory connectivity motif affects the circuit’s ability to maintain the working memory of a choice. Contraspecific and nonspecific circuits maintain a difference in excitatory firing rates of at least 15 Hz for a very long time following stimulus offset, whereas ipsispecific circuits exhibit a degradation of the choice readout (Fig. 4a). This behavior can be linked to the phase plane of the unstimulated circuit. Working memory is supported by two choice attractors that are separated by saddle points from the attractor with symmetric low activity state. The separation between the working memory attractors and the saddle points is smaller for more ipsispecific circuits (Fig. 4b). For highly ipsispecific circuits, working memory attractors are extinguished after merging with the saddle points (Fig. 4b).

Inhibitory choice selectivity in trained recurrent neural networks

So far, we used the mean-field approach to establish that choice-selective inhibition supports the function of decision-making circuits by enhancing a competitive or stabilizing role. Next, we wanted to test whether this result holds broadly by using another class of decision-making network models. We therefore trained excitatory-inhibitory recurrent neural networks (RNNs) to perform a decision-making task²⁸ and then tested whether inhibitory choice-selectivity regularly emerges in these networks after training and whether the dependence between the excitatory and inhibitory specificity aligns with the two roles for inhibition. We used RNNs with 100 excitatory and 25 inhibitory units (Fig. 5a), but our results are not specific to this number of units and hold in RNNs with twice the size (Supplementary Fig. 5). Two input streams projected to all excitatory units through input weights. Two output variables were calculated as a weighted sum of excitatory unit activity. We trained RNNs to perform an identical decision-making task as the mean-field circuits by raising an output variable which corresponds to the input stream with a higher mean value. Networks were trained by back-propagation through time to minimize the mean squared error between the network outputs and predefined targets. For a given trial, a choice was recorded when the output variables became separated by a fixed threshold set to 0.25. Trials were considered invalid if the outputs separated prior to the stimulus, failed to maintain separation after stimulus offset, or separation was never achieved. We trained networks until the correct choice was made on 85% of all trials (including correct, error, and invalid trials) in a 200 trial epoch. One hundred and fifty networks reached this training threshold in 104, 343 ± 9, 264 (mean ± s.d.) trials, ranging from 83,200 to 127,600 (Fig. 5b). Networks performed the task well, making errors and failing to complete trials only for difficult stimuli (Fig. 5c). Trained networks also took longer to make decisions when presented with a difficult stimulus, similarly to mean-field circuits (Supplementary Fig. 6).

We determined whether inhibitory neurons in these RNNs were choice selective. We classified recurrent units as choice selective using receiver operator characteristic (ROC) analysis²¹ (Methods). We constructed ROC curves by decoding network choice from a unit’s activity on the time-step following stimulus offset. To identify which units significantly modulated their firing rate to reflect choice, we compared the area under the ROC curve (AUC_ROC) to a shuffle distribution generated from randomized trial labels (two-sided permutation test, p < 0.05, 150 permutations). Units that were identified as choice selective increased activation following the onset of a stimulus corresponding to their preferred choice (Fig. 5d). Inhibitory units had overall higher choice selectivity than excitatory units, as measured by the selectivity index ∣AUC_ROC − 0.5∣ that can range from 0 to 0.5 (Fig. 5e, inhibitory 0.23 ± 0.17, excitatory 0.12 ± 0.16; mean ± s.d.; Wilcoxon rank-sum test p < 10⁻¹⁰). Also, the proportion of significantly selective units was higher for inhibitory than excitatory units (Fig. 5f, inhibitory 0.87 ± 0.07, excitatory 0.72 ± 0.06; mean ± s.d.; Wilcoxon Rank-Sum test p < 10⁻¹⁰). Thus, inhibitory unit activity contained overall more choice information than excitatory unit activity despite the fact that only excitatory units received stimulus input. In this respect RNNs differ from experimental data in which excitatory and inhibitory neurons contained similar choice information²¹.

Excitatory specificity aligns with ispi- and contraspecific inhibitory motifs in RNNs

Based on our mean-field model, we know that for choice-selective inhibition to impact circuit function, the connections from inhibitory to excitatory populations must be specific. Therefore, after identifying choice-selective units in RNNs, we sought to determine whether the connection specificity of excitatory-excitatory and excitatory-inhibitory pairs followed the relationship predicted by the mean-field model (Fig. 2b). To analyze the specificity of connections between choice-selective populations in the RNNs, we estimated the specificity parameter γ from the weights of trained RNNs defined in the same way as for the mean-field model (Methods). Trained networks consistently had strong excitatory-excitatory (γ_EE = 0.59 ± 0.07) and excitatory-inhibitory (γ_EI = 0.39 ± 0.06) specificity (Fig. 5g). This result is consistent with the constraint that inhibitory units inherit stimulus information from excitatory units to be choice or stimulus selective. Inhibitory-excitatory connections were nonspecific on average (γ_IE = 3.6 × 10⁻³ ± 0.03) but their distribution showed both ipsispecific and contraspecific motifs. Inhibitory-inhibitory connections were nonspecific on average with higher variation than inhibitory-excitatory connections (γ_II = −5.0 × 10⁻³ ± 0.06). Confirming the trend predicted by the mean-field model, excitatory specificity γ_EE was correlated with the inhibitory specificity index γ_EIγ_IE, where networks with stronger recurrent excitation were ipsispecific and networks with weaker recurrent excitation were contraspecific (Pearson’s r = 0.53, p < 10⁻¹⁰; Fig. 5h). When comparing the connection classes individually, we found positive correlations between excitatory-excitatory, excitatory-inhibitory, and inhibitory-excitatory specificity (Fig. 5i). Inhibitory-inhibitory connection specificity was not significantly correlated with any other connection class. The higher variance and negligible correlation with other connection classes suggest that the specificity of inhibitory-inhibitory connections was unconstrained in these networks, in line with the mean-field model, where specificity of inhibitory-inhibitory connections also had a small effect on whether circuits could perform decisions (Supplementary Fig. 3d). These results show that RNNs utilize choice selective inhibition to compensate for variation in excitatory-excitatory specificity.

To further test the relationship between the excitatory and inhibitory specificity, we trained additional sets of RNNs with higher or lower excitability of excitatory units. In the mean-field model, lower (higher) excitatory gain can be compensated by either an increase (decrease) in excitatory connection specificity or by strengthening of the contraspecific (ipsispecific) motif. Accordingly, we expect that changing the activation function slope of the excitatory units in RNNs should either shift the excitatory-excitatory specificity against the direction of the gain change or shift the inhibitory specificity towards contraselective (for lower slope) or ipsielective motif (for higher slope). We trained two additional sets of networks with hypoexcitable (slope 0.5) or hyperexcitable (slope 1.5) excitatory units. Changing the excitability of excitatory units led to large shifts in γ_EE without changing the distribution of inhibitory specificity (Supplementary Fig. 7). In these networks, γ_EE and γ_EIγ_IE were still correlated, with higher γ_EE leading to higher γ_EIγ_IE (Supplementary Fig. 8). These results indicate that excitatory-excitatory specificity is a higher leverage parameter that RNNs use as the most effective path to compensate for changes in the excitability of excitatory units. This observation is consistent with the effect of changes in γ_EE on the dynamics in the mean-field model. For accuracy, decision-time and τ_slow, changes in γ_EE are far more effective than changes in inhibitory specificity (Supplementary Fig. 9) when all other parameters are held constant. In both the mean-field and RNN models, excitatory-excitatory specificity has a larger effect than inhibitory specificity and is the main lever circuits use to compensate for changes in neural parameters.

Perturbing inhibitory neuron activity reveals regimes where stabilizing and competitive inhibition dominate

Using the mean-field and RNN models, we established how contra- and ipsispecific inhibitory motifs enhance two different roles for inhibition in decision making circuits. To further probe these roles, we next considered how circuits respond to perturbations of inhibitory neuron activity. We used perturbations that equally targeted all inhibitory neurons irrespective of their choice selectivity by driving them with a nonspecific input Δν_0,I (Fig. 6a). Such perturbations could be realized in optogenetic experiments. In circuits where the competitive role of inhibition dominates, we expect that enhancing inhibitory activity should speed up dynamics whereas suppressing inhibition should slow them down (Fig. 6b). Vice versa, in circuits where the stabilizing role of inhibition dominates, we expect that enhancing inhibitory activity should slow dynamics down and suppressing inhibition should speed them up (Fig. 6b). Because τ_slow provides a readily available estimate of the pace of dynamics in the mean-field model, we calculated τ_slow for varying nonspecific baseline input to inhibitory neurons ν_0,I. We found that depending on the baseline level of inhibitory activity both regimes are possible in the mean-field circuit: one where competitive role dominates and one where stabilizing role dominates (Fig. 6c). Around a low baseline value of inhibitory activity (ν_0,I = 11.5 in Fig. 6c), contra-, ipsi-, and nonspecific circuits respond to perturbations similarly, such that enhancing inhibition (Δν_0,I > 0) leads to a decrease in τ_slow, i.e. faster dynamics. Around a high baseline value of inhibitory activity (ν_0,I = 14 in Fig. 6c), all circuits respond in the opposite way, such that enhancing inhibition increases τ_slow. This U-shaped dependence of τ_slow on the baseline input to inhibitory neurons ν_0,I results from the system approaching bifurcation points at either extreme of the parameter range that supports decision making⁴ (Supplementary Fig. 10). These two regimes–a low inhibition and a high inhibition regime–differ in which role of inhibition dominates: competitive or stabilizing, respectively. The inhibitory motif (contra-, non-, or ipsispecific) further shifts this emphasis within the constraints of each regime. These regimes can be identified via perturbations by characterizing how the circuit dynamics respond to changes in inhibitory tone.

**Fig. 6: Perturbations to inhibitory activity reveal regimes where stabilizing and competitive inhibition dominate.**

To confirm the existence of competitive and stabilizing regimes, we perturbed the mean-field circuits around the low and high baseline values of the inhibitory activity. We enhanced or suppressed inhibition during the stimulus period of a trial and measured changes in the circuit performance. We constructed a set of metrics to quantify changes in the fraction of completed trials, decision time, and choice accuracy relative to the unperturbed circuit for all stimulus strengths. The effects of these perturbations followed the predictions from the calculation of τ_slow (Fig. 6d–k). Enhancing inhibition decreased decision time in the low inhibition regime, but increased decision time in the high inhibition regime (cf. Fig. 6d, f and h, j). Consistent with the slowing effects of the perturbation, circuits in the high inhibition regime failed more often to complete trials (Fig. 6e) and became more accurate (Fig. 6g) when inhibition was enhanced. Circuits in the low inhibition regime showed the opposite behavior (Fig. 6h–k). Thus, by perturbing inhibitory neuron activity we can determine whether the competitive or stabilizing inhibition dominates in a circuit.

We then delivered enhancing or suppressing perturbations to inhibitory units in trained RNNs during the stimulus period to identify in which inhibitory regime these networks operate. Enhancing inhibition increased decision times, reduced the fraction of completed trials, and increased accuracy, consistent with these RNNs operating in the stabilizing inhibition regime (cf. Fig. 6h–k and l–o).

Discussion

We showed that choice selectivity of inhibitory neurons can affect the function of decision making circuits by enhancing one of two roles for inhibition: facilitating competition or stabilizing recurrent excitation. In the mean-field model, choice selective inhibition and specific connections from inhibitory to excitatory populations expand the excitatory-excitatory specificity parameter space of circuits that support decision-making. For the range of excitatory connection specificities supporting both ipsispecific and contraspecific inhibitory circuits, the speed and accuracy of decisions tightly depend on whether the ipsi- or contraspecific inhibitory motif is present. Inhibitory choice selectivity also emerges in RNNs trained to perform a decision-making task, and the specificity of excitatory and inhibitory connections within trained RNNs is correlated, consistent with the mean-field model predictions. The mean-field model further predicts the existence of two dynamical regimes: (i) a low-inhibition regime where the competitive role dominates, and (ii) a high-inhibition regime where stabilizing role dominates. In trained RNNs, perturbations of all inhibitory neurons indicate that these networks operate in the stabilizing inhibition regime.

Decision-making circuits with non-selective inhibition exist only within a narrow range of excitatory-excitatory connection specificity. When inhibitory neurons inherit choice-selectivity from excitatory neurons and also project to excitatory neurons via specific connections, a broad range of circuit configurations can support decision-making. In circuits capable of decision-making, the correlation between the specificity of excitatory (γ_EE) and inhibitory connections (γ_EIγ_IE) reveals how the contra- and ipsispecific motifs enhance one of two roles for inhibition: facilitate competition between populations coding for opposite choices or stabilize amplification driven by strongly recurrent excitation. When γ_EE is low and excitatory populations alone cannot drive selective activation, contraspecific inhibitory motifs support decision-making by maximizing competition. Conversely, when γ_EE is high and excitatory self-amplification becomes unstable, ipsispecific inhibitory motifs stabilize firing rates.

The categorical output of decision-making circuits is thought to be driven by strongly selective excitatory to excitatory selectivity with the evidence accumulation based on amplification through NMDA receptors^2,4. In these models the specificity of excitatory connections is sufficient to drive competition and selective activation. We found that deviations from a narrow range of γ_EE require complementary inhibitory circuitry. When recurrent excitatory specificity is low, contraspecific inhibition is required to form the attractors needed for decision-making computation. This mechanism was described in circuits where excitatory populations have limited capacity for amplification, such as the midbrain circuit in the owl²², and in linear integrator models²⁹. On the other hand, when recurrent excitatory specificity is high, the strong excitatory feedback amplification needs matching ipsispecific inhibition to stabilize the circuit. This mode of inhibitory selectivity is known to improve stability and robustness of a circuit to perturbations^17,30. Additionally, shifts in E/I balance through modulation of gain or synaptic efficacy can improve the robustness and parameter range of decicion-making circuit models^31,32.

We found a similar relationship between excitatory and inhibitory connection specificity in RNNs suggesting the balance between competitive and stabilizing inhibition is a general principle in E-I networks. While specific connections between excitatory and inhibitory units were clearly important for the decision-making function in our networks, connections between inhibitory units appeared unconstrained, indicating this connection class has limited effect on circuit function like in the mean-field model. RNNs are increasingly often used to develop theories of how neural circuits perform computations^23,28,33. Some studies trained RNNs under the constraint that units have either exclusively excitatory or exclusively inhibitory outputs^28,34 (Dale’s law). Studies of E-I RNNs which focus on the impact of inhibitory connections show that specificity of inhibitory-inhibitory connections can be critical to circuit function²³. The apparent difference in the importance of inhibitory-inhibitory selectivity between our networks and previous work could result from differences in the training procedures³⁵. We observed a large impact of RNN training hyperparameters on the emerging circuit structure. Future work is needed to understand how details of training influence the emerging circuit structure and computations performed by RNNs.

Our results show that selective inhibition can have a marked effect on the function of neural circuits. Many models of categorical decision-making rely on a nonspecific pool of inhibitory neurons to enforce winner-take-all competition between excitatory neurons^2,3. While these models reproduce the dynamics of decision-making circuits they do not fully account for the diversity of interneurons within the cortex. Cortical inhibitory neurons show selective activation in many modalities including primary sensory^13,17,36,37 and association areas^19,20,21. Moreover, choice-selectivity of parietal inhibitory neurons is equal to that of excitatory neurons during an audio-visual discrimination task²¹.

In the mean-field model, we assume that choice selectivity of inhibitory neurons arises from specific connections from choice-selective excitatory neurons (γ_EI in our model). While it is possible that choice selectivity could arise from external inputs to interneurons³⁸ or even from random connections between excitatory and inhibitory neurons³⁹, most circuit models assume stimulus information is exclusively provided by inputs to excitatory neurons. Inhibitory choice-selectivity also emerged in our RNNs trained to perform 2AFC task²⁸. In our RNNs, inhibitory units can only inherit stimulus or choice information through specific connections from excitatory populations, unlike in other trained RNNs²³. For both excitatory and inhibitory units in trained RNNs, we found that the fraction of selective units was higher than is commonly used in circuit models^2,4 and found in experiments²¹. This difference could be due to the simplicity of RNNs compared to in vivo circuits, and also a training process which aims to minimize total activity through regularization. In addition, decisions in the RNN are fully determined by the local circuit, whereas an animal’s behavioral output arises from a broadly distributed circuitry. Although higher choice selectivity for inhibitory units was robust to doubling the network size (Supplementary Fig. 5b, c), it could result from the need to leverage all of these units in a network much smaller than those in the brain.

The core computation of the model is the selective activation of a single excitatory population when the stimulus is presented and a mechanism to integrate stimulus information before diverting to a choice attractor. By enhancing stability, ipsispecific circuits lengthen the period when a circuit can maintain mutual activation of populations encoding competing choices, thus increasing the integration window which leads to more accurate stimulus classifications. Contraspecific circuits, primed for competition, minimize the integration period which increases error frequency.

In attractor networks, modulation of τ_slow for controlling the speed and accuracy of decisions is well known and can arise from other mechanisms than inhibitory output specificity. In the model with nonspecific inhibition, τ_slow increases with stimulus difficulty⁴ and can be also modulated via top-down excitation²⁷. Our finding that excitatory-inhibitory connectivity influences this well established mechanism highlights the importance of inhibitory circuitry to evidence accumulation. A key difference between controlling τ_slow via inhibitory motif versus top-down excitation is that the location of the saddle point is unaffected by γ_IE whereas increasing top-down excitation shifts the saddle towards the origin, effectively acting as a collapsing decision-bound²⁷. Top-down excitation can be adjusted rapidly from one trial to the next to match the decision’s speed and accuracy to the task demands. Could the inhibitory motif also be dynamically changed to meet changing task requirements? Modulation of the speed-accuracy trade-off through changes of the inhibitory motif may be mediated by activation or inactivation of inhibitory subpopulations connected in either a contraspecific or ipsispecific pattern (representing a shift in γ_IE for the circuit as a whole).

Selective neuromodulatory control of genetically identifiable inhibitory subtypes may provide for control of inhibitory motifs. Inhibitory subtypes have distinct connectivity patterns to neighboring excitatory neurons: fast-spiking cells have far more reciprocal connections to excitatory neurons than adapting interneurons¹⁶. A shift in output specificity could be mediated through top-down activation of inhibitory subnetworks or through neuromodulation of distinct inhibitory subtypes such as PV⁺, SOM⁺, or VIP⁺. Acetylcholine has layer-dependent effects on the responsiveness of both regular spiking and fast spiking neurons in the visual cortex, which could differentially activate distinct inhibitory motifs on behaviorally relevant timescales^40,41,42. Additionally, acetylcholine can reduce the release of inhibitory neurotransmitters in cortical neurons⁴³, thus directly affecting inhibitory connectivity.

Our mean-field framework reduces the dynamics of the full network with 6 excitatory and inhibitory populations to a two-variable system using several approximations, in particular, the steady-state assumption for GABA dynamics⁴. This assumption is based on the timescale separation between decay time constants of slow NMDA (~100 ms) and fast GABA (~5 ms) conductance. The slow NMDA dynamics dominate the time evolution of network activity, and one can assume that all other variables reach their steady-state nearly instantly⁴. Despite being fast, the dynamics of GABA synapses can also affect decision-making behavior⁴⁴. A study⁴⁴ considered a set of circuits with parameters chosen so that when the steady-state assumption is applied, all models reduce to the two-variable model with the exact same parameter set. Thus, all differences in dynamics of these circuits were driven by GABA dynamics. In these circuits, the GABA dynamics mediated a speed-accuracy trade-off and, moreover, this tradeoff was more efficient in circuits with selective inhibition⁴⁴. While this study considered only ipsispecific inhibitory connectivity and a narrow space of circuits that all map onto a single parameter set of a two-variable model, our work explores a wide range of circuit configurations ranging from contraspecific to ipsispecific inhibitory motifs. Our findings are robust to the steady-state approximation of GABA dynamics as we show using a four-variable mean-field model (Supplementary Fig. 3). Together these results show that inhibitory connectivity motifs and GABA dynamics both affect decision-making behavior.

Another key performance metric that depends on selective inhibition is the rate of trial completion. Our models (both the mean-field and RNNs) fail to reach the imposed decision threshold on a fraction of trials with low stimulus strength, which we call invalid trials. This behavior is common across spiking^2,32,45, mean-field^4,44 and RNN²⁸ models of decision-making. Our treatment of invalid trials is conservative, as we report invalid trials as a separate behavioral outcome different from correct or incorrect decision³², whereas most other studies assign a choice at random on trials when the network does not reach the decision threshold^2,4,28,44,45. The random assignment of choices on invalid trials can conceal differences in network dynamics, making distinct dynamical regimes indistinguishable in psychometric functions⁴⁵. We find that the completion rate of difficult trials is reduced in circuits where stability is emphasized due to increased integration time. Circuit models frequently differ from experimental subjects in the rate of trial completion, which was attributed to an urgency signal gating the evidence accumulation process which is absent in circuit models^46,47,48,49. One possible mechanism for an urgency signal in decision circuits could be a nonspecific external ramping input⁵⁰. Incorporating such inputs into future models of decision-making would be an important next step in the study of selective inhibition.

We show that choice selective inhibition can enhance one of two roles for inhibition in decision-making circuits: facilitating competition or stabilizing excitatory feedback. Both these roles are simultaneously fulfilled by inhibition in any decision making circuit. Enhancing activity of all inhibitory neurons can shift the circuit from a regime where the competitive role dominates to a regime where the stabilizing role dominates regardless of which inhibitory motif is present. This effect echos results which find shifts in E/I balance can induce leaky or unstable integration⁴⁵. The stabilizing and competitive regimes can be differentiated by the behavioral response to perturbations of inhibitory activity. Perturbations during reaction time tasks should reveal which inhibitory role is dominant in vivo. The balance of these two roles is critical for circuits to perform decision tasks, and shifts in this balance could align dynamics with changing task requirements. More experimental work is needed to uncover how inhibitory subnetworks strike this balance in the cortex. Specifically, whether functional selectivity is constrained to certain inhibitory subtypes and whether inhibitory neurons are recruited to perform a task in a state dependent manner are important questions for future work.

Methods

Mean-field model

Our mean-field model accounts for interactions among 6 populations: 3 excitatory (2 choice selective and 1 nonselective) and 3 inhibitory (2 choice selective and 1 nonselective). Including nonselective neurons in the model is consistent with previous work⁴ and reflects the experimental observation that only a fraction of all recorded neurons shows choice selectivity²¹. Each selective population contains the fraction f of the total number N_E (N_I) of excitatory (inhibitory) neurons, so that 1 − 2f is the proportion of nonselective neurons. We reduce the dynamics of the full network with 6 excitatory and inhibitory populations to a dynamical system with two variables representing the activations of N-methyl-D-aspartate (NMDA) conductances (in terms of fraction of channels open) for synapses originating from two choice-selective excitatory populations⁴. The model reduction to two dimensions leverages the timescale separation between decay time constants of the slow NMDA (~100 ms) and fast γ-aminobutyric acid (GABA, ~5 ms) and α-amino-3-hydroxy-5-methyl-4-isoxazolepropionic acid (AMPA, ~2 ms) receptors. The slow NMDA dynamics dominate the time evolution of the system, and one can assume that all other variables reach their steady-state nearly instantly⁴. The dynamics of the NMDA activation variable for population i (i ∈ {1, 2}) are governed by:

$$\frac{d{S}_{i}}{dt}=\frac{-{S}_{i}}{{\tau }_{{{{{{{{\rm{NMDA}}}}}}}}}}+(1-{S}_{i})\gamma {{\Phi }}({x}_{i}),$$

(1)

where τ_NMDA = 0.1 s and γ = 0.641. The non-linear function Φ transforms input current x_i [nA] into firing rate:

$${{\Phi }}({x}_{i})=\frac{a{x}_{i}-b}{1-{e}^{-d(a{x}_{i}-b)}},$$

(2)

where a = 270 nC⁻¹, b = 108 Hz, and d = 0.154 s. The input to population i is:

$${x}_{i}={\alpha }_{1}({\gamma }_{{{{{{{{\rm{EE}}}}}}}}},\,{\gamma }_{{{{{{{{\rm{EI}}}}}}}}},\,{\gamma }_{{{{{{{{\rm{IE}}}}}}}}}){S}_{i}+{\alpha }_{2}({\gamma }_{{{{{{{{\rm{EE}}}}}}}}},\,{\gamma }_{{{{{{{{\rm{EI}}}}}}}}},\,{\gamma }_{{{{{{{{\rm{IE}}}}}}}}}){S}_{j}+{I}_{0,\,i}({\gamma }_{{{{{{{{\rm{EE}}}}}}}}},\,{\gamma }_{{{{{{{{\rm{EI}}}}}}}}},\,{\gamma }_{{{{{{{{\rm{IE}}}}}}}}})+{I}_{{{{{{{{\rm{stim}}}}}}}},\,i}+{I}_{\eta,\,i},$$

(3)

where index j refers to the other excitatory population. The complexity of the circuit structure, including interactions between all selective and nonselective excitatory and inhibitory neurons, is collapsed into two-dimensional model through the variables α₁, α₂, and I_0,i as described in the section Circuit Structure below.

The stimulus ${I}_{{{{{{{{\rm{stim}}}}}}}},i}$ is defined as an increase in the rate of external excitatory inputs to choice-selective excitatory neurons of magnitude μ. We define the strength of evidence for one versus the other choice as stimulus coherence c, which can range between −100% and 100%. For population i the stimulus is then defined as:

$${I}_{{{{{{{{\rm{stim}}}}}}}},\,i}(t,\,\mu,\,c)=\left\{\begin{array}{ll}{J}_{{{{{{{{\rm{AMPA}}}}}}}},{{{{{{{\rm{ext}}}}}}}}}\mu (1-\frac{c}{100})&{t}_{{{{{{{{\rm{stim}}}}}}}},{{{{{{{\rm{on}}}}}}}}} \, < \,t \, < \,{t}_{{{{{{{{\rm{stim}}}}}}}},{{{{{{{\rm{off}}}}}}}}},\; i=1,\\ {J}_{{{{{{{{\rm{AMPA}}}}}}}},{{{{{{{\rm{ext}}}}}}}}}\mu (1+\frac{c}{100})&{t}_{{{{{{{{\rm{stim}}}}}}}},{{{{{{{\rm{on}}}}}}}}} \, < \,t \, < \,{t}_{{{{{{{{\rm{stim}}}}}}}},{{{{{{{\rm{off}}}}}}}}},\; i=2,\\ 0 \hfill &{{{{{{{\rm{otherwise}}}}}}}}.\hfill\end{array}\right.$$

(4)

For all cases, we set μ to 40 Hz. Noise is introduced through the inputs I_η,i to the two excitatory populations filtered through fast synaptic activation of AMPA receptors:

$$\frac{d{I}_{\eta,i}}{dt}=-\frac{{I}_{\eta,i}}{{\tau }_{{{{{{{{\rm{AMPA}}}}}}}}}}+\frac{\eta (t)}{\sqrt{{\tau }_{{{{{{{{\rm{AMPA}}}}}}}}}}},$$

(5)

where τ_AMPA is 0.002 s and η(t) is a white Gaussian noise with zero mean and standard deviation 0.02 nA. We performed numerical simulations using the Euler method with a 2 ms time step.

Circuit structure

We derived two-dimensional mean-field equations, which model the dynamics of the entire circuit through the effective interaction strengths α₁, α₂ between the two excitatory populations, and the background currents I_0,i. This reduced model is based on approximating the firing rates of all three inhibitory populations (two choice-selective and one nonselective) and of the nonselective excitatory population as linear functions of their inputs. Thus, the firing rates of these populations change linearly in response to changes in the firing rates of the two explicitly modeled excitatory populations E₁ and E₂⁴. We define α₁ as a term which describes how activity S₁₍₂₎ from the excitatory population E₁₍₂₎ filters through the circuit (i.e. via E₂₍₁₎, E₀, I₀, I₁, I₂, and feeding back onto itself) to impact its own firing rate. Similarly, α₂ describes how the activity S₁₍₂₎ filters through the circuit to impact the firing rate of the opposite excitatory population. I_0,i describes the net input from the population activity that does not depend on the activity of E₁ or E₂. Thus, this model accounts for interactions between all six populations with only two dynamical system equations Eq. (1).

We parametrized connection specificity between choice-selective populations by γ_JK between presynaptic population J and postsynaptic population K. The index J, K ∈ {E, I} defines neuron type as excitatory or inhibitory. We translate γ_JK to a synaptic weight under a constraint that the total input to each population remains constant for all values of γ_JK. To this end, we defined an intermediate weight ${\hat{w}}_{JK}={N}_{s}{w}_{J}/({N}_{{{{{{{{\rm{s}}}}}}}}}+{\gamma }_{JK}(2-{N}_{{{{{{{{\rm{s}}}}}}}}}))$, where N_s = 2 is the number of competing choice-selective populations and w_E = w_I = 1. We then set connection weights between populations with the same choice selectivity to ${w}_{JK}^{+}={\hat{w}}_{JK}+{\gamma }_{JK}{\hat{w}}_{JK}$ and between populations with opposite selectivity to ${w}_{JK}^{-}={\hat{w}}_{JK}-{\gamma }_{JK}{\hat{w}}_{JK}$. We can rewrite γ in terms of w⁺ and w⁻ as:

$$\gamma=\frac{{w}^{+}-{w}^{-}}{{w}^{+}+{w}^{-}}.$$

(6)

Connections to and from nonselective neurons were held at w_J = 1. This definition enforces that all neurons receive the same total input weight for any value of γ_JK. We set the specificity parameter γ_EE = 0.32 as in refs. ^2,4, except in Figs. 1 and 2. We set γ_EI = 0.25 except in Figs. 1 and 2.

The effective interaction strengths α₁ describes the recurrent feedback from an excitatory population’s activity onto itself fed through other populations in the circuit. This term consists of four components α₁ = λ₁(α_1a + α_1b + α_1c + α_1d):

$${\alpha }_{1a}=f{N}_{{{{{{{{\rm{E}}}}}}}}}{w}_{{{{{{{{\rm{EE}}}}}}}}}^{+}\,{J}_{{{{{{{{\rm{NMDA}}}}}}}},{{{{{{{\rm{eff}}}}}}}},{{{{{{{\rm{E}}}}}}}}},$$

(7)

$${\alpha }_{1b}=\frac{1}{\kappa {g}_{{{{{{{{\rm{I2}}}}}}}}}}({c}_{{{{{{{{\rm{I}}}}}}}}}\,f{N}_{{{{{{{{\rm{E}}}}}}}}}{w}_{{{{{{{{\rm{EI}}}}}}}}}^{+}\,{J}_{{{{{{{{\rm{NMDA}}}}}}}},{{{{{{{\rm{eff}}}}}}}},{{{{{{{\rm{I}}}}}}}}})(\,f{w}_{{{{{{{{\rm{IE}}}}}}}}}^{+}{N}_{{{{{{{{\rm{I}}}}}}}}}\,{J}_{{{{{{{{\rm{GABA}}}}}}}},{{{{{{{\rm{E}}}}}}}}}{\tau }_{{{{{{{{\rm{GABA}}}}}}}}}),$$

(8)

$${\alpha }_{1c}=\frac{1}{\kappa {g}_{{{{{{{{\rm{I2}}}}}}}}}}({c}_{{{{{{{{\rm{I}}}}}}}}}\,f{N}_{{{{{{{{\rm{E}}}}}}}}}{w}_{{{{{{{{\rm{EI}}}}}}}}}^{-}\,{J}_{{{{{{{{\rm{NMDA}}}}}}}},{{{{{{{\rm{eff}}}}}}}},{{{{{{{\rm{I}}}}}}}}})(\,f{w}_{{{{{{{{\rm{IE}}}}}}}}}^{-}{N}_{{{{{{{{\rm{I}}}}}}}}}\,{J}_{{{{{{{{\rm{GABA}}}}}}}},{{{{{{{\rm{E}}}}}}}}}{\tau }_{{{{{{{{\rm{GABA}}}}}}}}}),$$

(9)

$${\alpha }_{1d}=\frac{1}{\kappa {g}_{{{{{{{{\rm{I2}}}}}}}}}}({c}_{{{{{{{{\rm{I}}}}}}}}}\,f{N}_{{{{{{{{\rm{E}}}}}}}}}{w}_{{{{{{{{\rm{E}}}}}}}}}\,{J}_{{{{{{{{\rm{NMDA}}}}}}}},{{{{{{{\rm{eff}}}}}}}},{{{{{{{\rm{I}}}}}}}}})(\,f{w}_{{{{{{{{\rm{I}}}}}}}}}{N}_{{{{{{{{\rm{I}}}}}}}}}\,{J}_{{{{{{{{\rm{GABA}}}}}}}},{{{{{{{\rm{E}}}}}}}}}{\tau }_{{{{{{{{\rm{GABA}}}}}}}}}).$$

(10)

These components of α₁ account for the effect of an excitatory population’s activity on its own activity filtered via (a) direct self-coupling, (b) the activity of the inhibitory population with the same choice selectivity, (c) the activity of the inhibitory population with the opposite choice selectivity, and (d) the activity of nonselective inhibitory neurons. Similarly, α₂ describes the influence of one excitatory population’s activity onto the other fed through all other populations in the circuit and also consists of four components α₂ = λ₂(α_2a + α_2b + α_2c + α_2d):

$${\alpha }_{2a}=f{N}_{{{{{{{{\rm{E}}}}}}}}}{w}_{{{{{{{{\rm{EE}}}}}}}}}^{-}\,{J}_{{{{{{{{\rm{NMDA}}}}}}}},{{{{{{{\rm{eff}}}}}}}},{{{{{{{\rm{E}}}}}}}}},$$

(11)

$${\alpha }_{2b}=\frac{1}{\kappa {g}_{{{{{{{{\rm{I2}}}}}}}}}}({c}_{{{{{{{{\rm{I}}}}}}}}}\,f{N}_{{{{{{{{\rm{E}}}}}}}}}{w}_{{{{{{{{\rm{EI}}}}}}}}}^{-}\,{J}_{{{{{{{{\rm{NMDA}}}}}}}},{{{{{{{\rm{eff}}}}}}}},{{{{{{{\rm{I}}}}}}}}})(\,f{w}_{{{{{{{{\rm{IE}}}}}}}}}^{+}{N}_{{{{{{{{\rm{I}}}}}}}}}\,{J}_{{{{{{{{\rm{GABA}}}}}}}},{{{{{{{\rm{E}}}}}}}}}{\tau }_{{{{{{{{\rm{GABA}}}}}}}}}),$$

(12)

$${\alpha }_{2c}=\frac{1}{\kappa {g}_{{{{{{{{\rm{I2}}}}}}}}}}({c}_{{{{{{{{\rm{I}}}}}}}}}\,f{N}_{{{{{{{{\rm{E}}}}}}}}}{w}_{{{{{{{{\rm{EI}}}}}}}}}^{+}\,{J}_{{{{{{{{\rm{NMDA}}}}}}}},{{{{{{{\rm{eff}}}}}}}},{{{{{{{\rm{I}}}}}}}}})(\,f{w}_{{{{{{{{\rm{IE}}}}}}}}}^{-}{N}_{{{{{{{{\rm{I}}}}}}}}}\,{J}_{{{{{{{{\rm{GABA}}}}}}}},{{{{{{{\rm{E}}}}}}}}}{\tau }_{{{{{{{{\rm{GABA}}}}}}}}}),$$

(13)

$${\alpha }_{2d}=\frac{1}{\kappa {g}_{{{{{{{{\rm{I2}}}}}}}}}}({c}_{{{{{{{{\rm{I}}}}}}}}}\,f{N}_{{{{{{{{\rm{E}}}}}}}}}{w}_{{{{{{{{\rm{E}}}}}}}}}\,{J}_{{{{{{{{\rm{NMDA}}}}}}}},{{{{{{{\rm{eff}}}}}}}},{{{{{{{\rm{I}}}}}}}}})(\,f{w}_{{{{{{{{\rm{I}}}}}}}}}{N}_{{{{{{{{\rm{I}}}}}}}}}\,{J}_{{{{{{{{\rm{GABA}}}}}}}},{{{{{{{\rm{E}}}}}}}}}{\tau }_{{{{{{{{\rm{GABA}}}}}}}}}).$$

(14)

The components of α₂ account for the effect on an excitatory population’s activity from the oppositely selective excitatory population’s activity filtered via (a) direct coupling, (b) the activity of the inhibitory population with the same selectivity, (c) the activity of the inhibitory population with the opposite selectivity, and (d) the activity of nonselective inhibitory neurons. The effects of nonselective neurons and external background inputs are described by I_0,i = λ_I(I_0,ia + I_0,ib + I_0,ic + I_0,id):

$${I}_{0,ia}=(1-{N}_{{{{{{{{\rm{s}}}}}}}}}f)\,{N}_{{{{{{{{\rm{E}}}}}}}}}{w}_{{{{{{{{\rm{E}}}}}}}}}\,{J}_{{{{{{{{\rm{NMDA}}}}}}}},{{{{{{{\rm{eff}}}}}}}},{{{{{{{\rm{E}}}}}}}}}{\psi }_{{{{{{{{\rm{3,in}}}}}}}}},$$

(15)

$${I}_{0,ib}={I}_{{{{{{{{\rm{AMPA}}}}}}}},{{{{{{{\rm{ext}}}}}}}},i}-(1-{N}_{{{{{{{{\rm{s}}}}}}}}}\,f){w}_{{{{{{{{\rm{I}}}}}}}}}{N}_{{{{{{{{\rm{I}}}}}}}}}\,{J}_{{{{{{{{\rm{GABA}}}}}}}},{{{{{{{\rm{E}}}}}}}}}{\tau }_{{{{{{{{\rm{GABA}}}}}}}}}({\nu }_{0,{{{{{{{\rm{I}}}}}}}}}+({c}_{{{{{{{{\rm{I}}}}}}}}}{I}_{0,{{{{{{{\rm{I}}}}}}}}}-{I}_{{{{{{{{\rm{m,I}}}}}}}}})/{g}_{{{{{{{{\rm{I2}}}}}}}}})/\kappa,$$

(16)

$${I}_{0,ic}=-f{w}_{{{{{{{{\rm{IE}}}}}}}}}^{+}{N}_{{{{{{{{\rm{I}}}}}}}}}\,{J}_{{{{{{{{\rm{GABA}}}}}}}},{{{{{{{\rm{E}}}}}}}}}{\tau }_{{{{{{{{\rm{GABA}}}}}}}}}({\nu }_{0,{{{{{{{\rm{I}}}}}}}}}+({c}_{{{{{{{{\rm{I}}}}}}}}}{I}_{0,{{{{{{{\rm{I}}}}}}}}}-{I}_{{{{{{{{\rm{m,I}}}}}}}}})/{g}_{{{{{{{{\rm{I2}}}}}}}}})/\kappa,$$

(17)

$${I}_{0,id}=-f{w}_{{{{{{{{\rm{IE}}}}}}}}}^{-}{N}_{{{{{{{{\rm{I}}}}}}}}}\,{J}_{{{{{{{{\rm{GABA}}}}}}}},{{{{{{{\rm{E}}}}}}}}}{\tau }_{{{{{{{{\rm{GABA}}}}}}}}}({\nu }_{0,{{{{{{{\rm{I}}}}}}}}}+({c}_{{{{{{{{\rm{I}}}}}}}}}{I}_{0,{{{{{{{\rm{I}}}}}}}}}-{I}_{{{{{{{{\rm{m,I}}}}}}}}})/{g}_{{{{{{{{\rm{I2}}}}}}}}})/\kappa,$$

(18)

where:

$${I}_{{{{{{{{\rm{AMPA}}}}}}}},{{{{{{{\rm{ext}}}}}}}},i}={J}_{{{{{{{{\rm{AMPA}}}}}}}},{{{{{{{\rm{ext}}}}}}}},{{{{{{{\rm{E}}}}}}}}}{\tau }_{{{{{{{{\rm{AMPA}}}}}}}}}{N}_{{{{{{{{\rm{ext}}}}}}}}}{\nu }_{{{{{{{{\rm{ext}}}}}}}}},$$

(19)

$${I}_{0,{{{{{{{\rm{I}}}}}}}}}={I}_{{{{{{{{\rm{AMPA}}}}}}}},{{{{{{{\rm{ext}}}}}}}},{{{{{{{\rm{I}}}}}}}}}+{J}_{{{{{{{{\rm{NMDA}}}}}}}},{{{{{{{\rm{eff}}}}}}}},{{{{{{{\rm{I}}}}}}}}}{w}_{{{{{{{{\rm{E}}}}}}}}}(1-{N}_{{{{{{{{\rm{s}}}}}}}}}\,f){N}_{E}{\psi }_{{{{{{{{\rm{3,in}}}}}}}}},$$

(20)

$${I}_{{{{{{{{\rm{AMPA}}}}}}}},{{{{{{{\rm{ext}}}}}}}},{{{{{{{\rm{I}}}}}}}}}={J}_{{{{{{{{\rm{AMPA}}}}}}}},{{{{{{{\rm{ext}}}}}}}},{{{{{{{\rm{I}}}}}}}}}{\tau }_{{{{{{{{\rm{AMPA}}}}}}}}}{\nu }_{{{{{{{{\rm{ext}}}}}}}}},$$

(21)

$${\psi }_{{{{{{{{\rm{3,in}}}}}}}}}=\frac{\gamma {\tau }_{{{{{{{{\rm{NMDA}}}}}}}}}{\nu }_{{{{{{{{\rm{3,in}}}}}}}}}}{1+\gamma {\tau }_{{{{{{{{\rm{NMDA}}}}}}}}}{\nu }_{{{{{{{{\rm{3,in}}}}}}}}}}.$$

(22)

These terms account for the input to the excitatory population E_i from the nonselective excitatory population filtered via (a) direct coupling, (b) the nonselective inhibitory population, (c) the inhibitory population with the same choice selectivity, (d) the inhibitory population with the opposite selectivity. The term ψ accounts for the NMDA activation of nonselective excitatory neurons. We calculated the firing rate of inhibitory populations as Φ_I,1(2) = α_1,IS₁₍₂₎ + α_2,IS₂₍₁₎ + I_0,II, where:

$${\alpha }_{1,{{{{{{{\rm{I}}}}}}}}}=({c}_{{{{{{{{\rm{I}}}}}}}}}\,f{N}_{{{{{{{{\rm{E}}}}}}}}}{w}_{{{{{{{{\rm{EI}}}}}}}}}^{+}\,{J}_{{{{{{{{\rm{NMDAeff}}}}}}}},{{{{{{{\rm{I}}}}}}}}})/{g}_{{{{{{{{\rm{I2}}}}}}}}},$$

(23)

$${\alpha }_{2,{{{{{{{\rm{I}}}}}}}}}=({c}_{{{{{{{{\rm{I}}}}}}}}}\,f{N}_{{{{{{{{\rm{E}}}}}}}}}{w}_{{{{{{{{\rm{EI}}}}}}}}}^{-}\,{J}_{{{{{{{{\rm{NMDAeff}}}}}}}},{{{{{{{\rm{I}}}}}}}}})/{g}_{{{{{{{{\rm{I2}}}}}}}}},$$

(24)

$${I}_{{{{{{{{\rm{0,II}}}}}}}}}={\nu }_{0,{{{{{{{\rm{I}}}}}}}}}+({c}_{{{{{{{{\rm{I}}}}}}}}}{I}_{0,{{{{{{{\rm{I}}}}}}}}}-{I}_{{{{{{{{\rm{m,I}}}}}}}}})/{g}_{{{{{{{{\rm{I2}}}}}}}}}.$$

(25)

All parameter values are provided in Table 1.

Table 1 Mean-field model parameters

Full size table

Evaluation of circuit performance

We considered a trial to be valid if the following criteria were met: (i) the firing rate difference between the two choice selective excitatory populations was less than 5 Hz for the entire period prior to stimulus onset, (ii) the firing rate difference was above the decision threshold of 15 Hz for at least one time step during the stimulus period and the time point following stimulus offset. Fraction completed trials for each stimulus level was defined as the number of valid trials out of all trials presented. Only valid trials were considered for computing chronometric and psychometric functions. Our treatment of invalid trials is more conservative than in many other studies, as we report invalid trials as a separate behavioral outcome different from correct or incorrect decision³², whereas many other studies assign a choice at random on trials when the network does not reach the decision threshold^2,4,28,44,45. The random assignment of choices on invalid trials can conceal differences in network dynamics, making distinct dynamical regimes indistinguishable in psychometric functions⁴⁵.

Phase plane and bifurcation analysis

We analyzed the mean-field model to find null-clines and fixed points using MatLab’s fsolve function with the Levenberg-Marquant algorithm and a tolerance of 1 × 10⁻⁶. To identify the stability of the fixed points, we computed the Jacobian matrix analytically and found its eigenvalues numerically using the eig() function in MatLab. For the saddle points, τ_slow is the inverse of the positive eigenvalue of the Jacobian matrix.

Recurrent neural network models

Recurrent neural networks (RNNs) were composed of 100 excitatory and 25 inhibitory units. We obtained the same results with networks twice as large (Supplementary Fig. 5). The dynamics of these networks were governed by the equations:

$${{{{{{{{\bf{x}}}}}}}}}_{{{{{{{{\rm{E}}}}}}}}}(t)= (1-{\alpha }_{{{{{{{{\rm{r}}}}}}}}}){{{{{{{{\bf{x}}}}}}}}}_{{{{{{{{\rm{E}}}}}}}}}(t-1)+{\alpha }_{{{{{{{{\rm{r}}}}}}}}}({{{{{{{{\bf{W}}}}}}}}}^{{{{{{{{\rm{EE}}}}}}}}}{{{{{{{{\bf{r}}}}}}}}}_{{{{{{{{\rm{E}}}}}}}}}(t-1)-{{{{{{{{\bf{W}}}}}}}}}^{{{{{{{{\rm{IE}}}}}}}}}{{{{{{{{\bf{r}}}}}}}}}_{{{{{{{{\rm{I}}}}}}}}}(t-1) \\ + {{{{{{{{\bf{W}}}}}}}}}^{{{{{{{{\rm{in}}}}}}}}}{{{{{{{{\bf{x}}}}}}}}}_{{{{{{{{\rm{in}}}}}}}}}(t)+{{{{{{{{\boldsymbol{\sigma }}}}}}}}}_{{{{{{{{\rm{r}}}}}}}}}^{{{{{{{{\rm{E}}}}}}}}}(t)),$$

(26)

$${{{{{{{{\bf{x}}}}}}}}}_{{{{{{{{\rm{I}}}}}}}}}(t)=(1-{\alpha }_{{{{{{{{\rm{r}}}}}}}}}){{{{{{{{\bf{x}}}}}}}}}_{{{{{{{{\rm{I}}}}}}}}}(t-1)+{\alpha }_{{{{{{{{\rm{r}}}}}}}}}({{{{{{{{\bf{W}}}}}}}}}^{{{{{{{{\rm{EI}}}}}}}}}{{{{{{{{\bf{r}}}}}}}}}_{{{{{{{{\rm{E}}}}}}}}}(t-1)-{{{{{{{{\bf{W}}}}}}}}}^{{{{{{{{\rm{II}}}}}}}}}{{{{{{{{\bf{r}}}}}}}}}_{{{{{{{{\rm{I}}}}}}}}}(t-1)+{{{{{{{{\boldsymbol{\sigma }}}}}}}}}_{{{{{{{{\rm{r}}}}}}}}}^{{{{{{{{\rm{I}}}}}}}}}(t)),$$

(27)

$${{{{{{{{\bf{x}}}}}}}}}_{{{{{{{{\rm{in}}}}}}}}}(t)=(1-{\alpha }_{{{{{{{{\rm{in}}}}}}}}}){{{{{{{{\bf{x}}}}}}}}}_{{{{{{{{\rm{in}}}}}}}}}(t-1)+{\alpha }_{{{{{{{{\rm{in}}}}}}}}}{{{{{{{\bf{u}}}}}}}}(t),$$

(28)

$${{{{{{{{\bf{r}}}}}}}}}_{{{{{{{{\rm{E}}}}}}}}({{{{{{{\bf{I}}}}}}}})}(t)={s}_{{{{{{{{\rm{E}}}}}}}}({{{{{{{\rm{I}}}}}}}})}{[{{{{{{{{\bf{x}}}}}}}}}_{{{{{{{{\rm{E}}}}}}}}({{{{{{{\bf{I}}}}}}}})}]}_{+},$$

(29)

$${{{{{{{\bf{z}}}}}}}}(t)={{{{{{{{\bf{W}}}}}}}}}^{{{{{{{{\rm{out}}}}}}}}}{{{{{{{{\bf{r}}}}}}}}}_{{{{{{{{\rm{E}}}}}}}}}(t).$$

(30)

Here x_E and x_I are the vectors of activation variables for excitatory and inhibitory units, respectively. r_E and r_I are the corresponding activities after applying the rectified linear (RELU) nonlinearity s_E(I)[]₊, where s_E(I) sets the excitability of the excitatory or inhibitory units. x_in is the input activation and u(t) is the instantaneous input. The time constants of recurrent units and inputs are set by α_r and α_in. Weights within and between units are housed in the matricies W_EE, W_EI, W_IE, W_II. Only the excitatory units receive projections from the input and project to the output through Wⁱⁿ and W^out, respectively.

RNNs received two input streams u(t) = [u₁(t), u₂(t)] representing sensory evidence:

$${u}_{i}(t,c)=\left\{\begin{array}{lll}{u}_{0}+(1+\mu \frac{c}{100})+{\sigma }_{{{{{{{{\rm{in}}}}}}}},i}(t)&{t}_{{{{{{{{\rm{stim}}}}}}}},{{{{{{{\rm{on}}}}}}}}} \, < \,t \, < \,{t}_{{{{{{{{\rm{stim}}}}}}}},{{{{{{{\rm{off}}}}}}}}},& i=1\\ {u}_{0}+(1-\mu \frac{c}{100})+{\sigma }_{{{{{{{{\rm{in}}}}}}}},i}(t)&{t}_{{{{{{{{\rm{stim}}}}}}}},{{{{{{{\rm{on}}}}}}}}} \, < \,t \, < \,{t}_{{{{{{{{\rm{stim}}}}}}}},{{{{{{{\rm{off}}}}}}}}},& i=2\\ {u}_{0}+{\sigma }_{{{{{{{{\rm{in}}}}}}}},i}(t)\hfill&{{{{{{{\rm{otherwise}}}}}}}}.\hfill\end{array}\right.$$

(31)

The stimulus period was 21 time steps and ${t}_{{{{{{{{\rm{stim}}}}}}}},{{{{{{{\rm{on}}}}}}}}}$ and ${t}_{{{{{{{{\rm{stim}}}}}}}},{{{{{{{\rm{off}}}}}}}}}$ were uniquely chosen for each trial. The stimulus magnitude μ = 3.2 was fixed and stimulus difficulty was set by c which ranged between − 20 and 20.

The recurrent and input noise are modeled by the elements of ${{{{{{{{\boldsymbol{\sigma }}}}}}}}}_{r}^{{{{{{{{\rm{E}}}}}}}}({{{{{{{\rm{I}}}}}}}})}(t)$ and σ_in(t) that are sampled from a Gaussian distribution. We ensure that each element has a standard deviation σ_0,r and σ_0,in via scaling:

$${\sigma }_{{{{{{{{\rm{r}}}}}}}},i}^{{{{{{{{\rm{E}}}}}}}}({{{{{{{\rm{I}}}}}}}})}(t)=\sqrt{2{\alpha }_{{{{{{{{\rm{r}}}}}}}}}}{\sigma }_{0,{{{{{{{\rm{r}}}}}}}}}{{{{{{{\mathcal{N}}}}}}}}(0,\,1),$$

(32)

$${\sigma }_{{{{{{{{\rm{in}}}}}}}},i}(t)=\sqrt{\frac{2}{{\alpha }_{{{{{{{{\rm{in}}}}}}}}}}}{\sigma }_{0,{{{{{{{\rm{in}}}}}}}}}{{{{{{{\mathcal{N}}}}}}}}(0,\,1).$$

(33)

RNN training

The goal of RNN training is to minimize the difference between the output z (N_trial × N_time × N_out) and targets T (N_trial × N_time × N_out). We set the entries in T to the baseline value of 0.2 and, following a stimulus onset, raise the entries to 1 for the output corresponding to the correct choice. This target is designed to train the network to remain in a low activity state until stimulated and elevate the correct output in response to a stimulus. Half of training trials were catch trials, on which no stimulus was presented and target values remained at 0.2 throughout the trial. The training batch consisted of N_trial = 200 trials which were randomly generated every training epoch. Within the training batch, noncatch trials were equally divided between possible choices and the difficulty was randomly sampled.

Recurrent network weights were randomly initialized from a Gamma distribution with a shape w_μ = 0.0375 and scale w_σ = 0.5 for excitatory weights W^EE, W^EI, and θw_μ and scale w_σ for inhibitory weights W^IE, W^II. The scaling factor θ = N_Es_E/N_Is_I adjusts the strength of inhibitory connections to offset for differences in the number and excitability between excitatory and inhibitory units. Input and output weights Wⁱⁿ, W^out were randomly initialized from a uniform distribution and then values were normalized so the weights associated with each input and output summed to 1 across units. All weights were trained via back-propagation through time to minimize the loss function:

$${{{{{{{\mathcal{L}}}}}}}}= \frac{1}{{N}_{{{{{{{{\rm{trial}}}}}}}}}}\frac{1}{{N}_{{{{{{{{\rm{time}}}}}}}}}}\mathop{\sum }\limits_{i=1}^{{N}_{{{{{{{{\rm{trial}}}}}}}}}}\mathop{\sum }\limits_{t=1}^{{N}_{{{{{{{{\rm{time}}}}}}}}}}\left(\frac{1}{{N}_{{{{{{{{\rm{out}}}}}}}}}}\mathop{\sum }\limits_{o=1}^{{N}_{{{{{{{{\rm{out}}}}}}}}}}{M}_{i,t}{({T}_{i,t,o}-{z}_{i,t,o})}^{2}+\frac{{\lambda }_{x}}{{N}_{e}+{N}_{i}}\mathop{\sum }\limits_{n=1}^{{N}_{e} \!+\!{N}_{i}}{x}_{i,t,n}^{2}\right)\\ +\frac{{\lambda }_{w}}{{({N}_{e}+{N}_{i})}^{2}}\mathop{\sum }\limits_{m,l=1}^{{N}_{e}\!+\!{N}_{i}}\left|{W}_{ml}\right|.$$

(34)

Here x is a concatenation of x_E and x_I of the size N_trial × N_time × (N_E + N_I), and W is a concatenation of W^EE, W^EI, W^IE, and W^II of the size (N_E + N_I) × (N_E + N_I). To encourage the network to integrate the stimulus for extended time, we used a mask M (N_trial × N_time), where entries were zero during the stimulus period so that time points during the stimulus were not considered when calculating the error term of the loss function. On catch trials, all entries of M were set to 1. The hyperparameter λ_x = 0.1 controls the amount of L2 regularization intended to minimize the activation of each unit. The hyperparameter λ_w = 1.0 controls the amount of L1 regularization applied to weights. We updated the weights by stochastic gradient descent using the ADAM optimizer in PyTorch and Python 3.7 with a learning rate 0.01. During training, the norm of the gradient was clipped at 1.

To maintain the identity of excitatory and inhibitory units and to keep the input and output weights positive, all negative elements of W^EE, W^EI, W^IE, W^II, Wⁱⁿ, and W^out were set to 0 after every training step. We prevent self-connections by elementwise multiplying W^EE and W^II by (1 − I), where I is the identity matrix and 1 is a matrix of 1s, after every training step.

We terminated RNN training based on its task performance. We tested RNN performance on a validation batch of trials after every training epoch. Each validation batch consisted of 100 trials with stimulus strength ranging between −20 and 20 in steps of 2. The network registered a decision when the difference between the output variables was above a threshold of 0.25. Trials were considered valid if at least 75% of the prestimulus period was below the decision threshold and at least 50% of the post stimulus period was above the decision threshold. Overall performance was measured as the fraction of correct choices out of all trials except for the ambiguous case where stimulus was equal to 0. We compute the accuracy and the psychometric function only using valid trials. We terminated training when a network’s overall performance reached 85%. RNN parameter values are shown in Table 2.

Table 2 Recurrent neural network parameters

Full size table

Measuring choice selectivity of RNN units

After training, we analyzed the activity of excitatory and inhibitory RNN units to quantify their choice selectivity. Our metric is based on the ability to decode the choice registered by the network based on the activity of the unit at the time point immediately following stimulus offset²¹. For each unit, we computed the receiver operating characteristic (ROC) using the roc function and the area under the ROC curve (AUC_ROC) using the trapz function in Matlab. A unit with the same activity for either choice will have an AUC_ROC equal to 0.5, thus our choice selectivity measure was defined by AUC_ROC − 0.5. To identify significantly selective units, we compared AUC_ROC to a shuffled distribution generated from that unit’s activity by shuffling the choice outcomes 150 times. We considered units to be choice selective if their AUC_ROC fell within the lowest or highest 2.5% percentiles of the shuffled AUC_ROC distribution.

Measuring connection specificity in RNNs

We measured the specificity of connections between choice selective units in RNNs. For each connection class (EE, EI, IE, and II), we computed $\left\langle {w}^{+}\right\rangle$ and $\left\langle {w}^{-}\right\rangle$, the mean strength of the weights between significantly selective units with, respectively, the same and opposite selectivity. Then we computed the specificity γ as:

$$\gamma=\frac{\left\langle {w}^{+}\right\rangle -\left\langle {w}^{-}\right\rangle }{\left\langle {w}^{+}\right\rangle+\left\langle {w}^{-}\right\rangle }.$$

(35)

This expression is identical to the specificity γ used in the mean-field model. To assess significance of correlations between γ for the 4 connection classes, we computed a shuffled distribution constructed by shuffling the network labels 5000 times.

Perturbing inhibitory populations

We perturbed activity of inhibitory neurons by delivering the same constant input to all inhibitory neurons during the stimulus period. In the mean-field model, we modified the parameter ν_0,I by a small amount within the range [−0.5, 0.5] around a baseline. We used two baseline values of ν_0,I: 11.5 for low-inhibitory regime and 14 for high-inhibitory regime. In RNNs, we delivered perturbations in a similar manner, where we delivered a constant input within the range [−1, 1] during the stimulus period.

Four-variable mean-field model

To model the effects of inhibitory-inhibitory specificity and dynamics of inhibitory synapses, we developed a simplified version of our model which explicitly modeled the activity of selective inhibitory populations. In this model, the dynamics of NMDA synapses for excitatory populations E₁ and E₂ (i = 1 and i = 2, respectively) are governed by:

$$\frac{d{S}_{i}}{dt}=-\frac{{S}_{i}}{{\tau }_{{{{{{{{\rm{NMDA}}}}}}}}}}+(1-{S}_{i})\gamma {{\Phi }}({x}_{i}),$$

(36)

and dynamics of GABA synapses for inhibitory populations I₁ and I₂ (i = 3 and i = 4, respectively) are governed by:

$$\frac{d{S}_{i}}{dt}=-\frac{{S}_{i}}{{\tau }_{{{{{{{{\rm{GABA}}}}}}}}}}+{{\Phi }}({x}_{i}).$$

(37)

The nonlinear activation function Φ(x) is of the form Eq. (2) with a = 310 nC⁻¹, b = 125 Hz, and c = 0.16 s for excitatory populations E₁ and E₂, and a = 615 nC⁻¹, b = 177 Hz, and c = 0.087 s for inhibitory populations I₁ and I₂. The input to population i is

$${x}_{i}=\mathop{\sum }\limits_{j=1}^{4}{A}_{i,j}{S}_{j}+{I}_{0,{{{{{{{\rm{E}}}}}}}}({{{{{{{\rm{I}}}}}}}})}+{I}_{{{{{{{{\rm{stim}}}}}}}},i}+{I}_{\nu,i},$$

(38)

where the adjacency matrix A is

$${{{{{{{\bf{A}}}}}}}}=\left(\begin{array}{llll}{w}_{{{{{{{{\rm{EE}}}}}}}}}^{+}\,{J}_{{{{{{{{\rm{NMDA}}}}}}}},{{{{{{{\rm{E}}}}}}}}}&{w}_{{{{{{{{\rm{EE}}}}}}}}}^{-}\,{J}_{{{{{{{{\rm{NMDA}}}}}}}},{{{{{{{\rm{E}}}}}}}}}&{w}_{{{{{{{{\rm{IE}}}}}}}}}^{+}\,{J}_{{{{{{{{\rm{GABA}}}}}}}},{{{{{{{\rm{E}}}}}}}}}&{w}_{{{{{{{{\rm{IE}}}}}}}}}^{-}\,{J}_{{{{{{{{\rm{GABA}}}}}}}},{{{{{{{\rm{E}}}}}}}}}\\ {w}_{{{{{{{{\rm{EE}}}}}}}}}^{-}\,{J}_{{{{{{{{\rm{NMDA}}}}}}}},{{{{{{{\rm{E}}}}}}}}}&{w}_{{{{{{{{\rm{EE}}}}}}}}}^{+}\,{J}_{{{{{{{{\rm{NMDA}}}}}}}},{{{{{{{\rm{E}}}}}}}}}&{w}_{{{{{{{{\rm{IE}}}}}}}}}^{-}\,{J}_{{{{{{{{\rm{GABA}}}}}}}},{{{{{{{\rm{E}}}}}}}}}&{w}_{{{{{{{{\rm{IE}}}}}}}}}^{+}\,{J}_{{{{{{{{\rm{GABA}}}}}}}},{{{{{{{\rm{E}}}}}}}}}\\ {w}_{{{{{{{{\rm{EI}}}}}}}}}^{+}{J}_{{{{{{{{\rm{NMDA}}}}}}}},{{{{{{{\rm{I}}}}}}}}}&{w}_{{{{{{{{\rm{EI}}}}}}}}}^{-}\,{J}_{{{{{{{{\rm{NMDA}}}}}}}},{{{{{{{\rm{I}}}}}}}}}&{w}_{{{{{{{{\rm{II}}}}}}}}}^{+}\,{J}_{{{{{{{{\rm{GABA}}}}}}}},{{{{{{{\rm{I}}}}}}}}}&{w}_{{{{{{{{\rm{II}}}}}}}}}^{-}\,{J}_{{{{{{{{\rm{GABA}}}}}}}},{{{{{{{\rm{I}}}}}}}}}\\ {w}_{{{{{{{{\rm{EI}}}}}}}}}^{-}\,{J}_{{{{{{{{\rm{NMDA}}}}}}}},{{{{{{{\rm{I}}}}}}}}}&{w}_{{{{{{{{\rm{EI}}}}}}}}}^{+}\,{J}_{{{{{{{{\rm{NMDA}}}}}}}},{{{{{{{\rm{I}}}}}}}}}&{w}_{{{{{{{{\rm{II}}}}}}}}}^{-}\,{J}_{{{{{{{{\rm{GABA}}}}}}}},{{{{{{{\rm{I}}}}}}}}}&{w}_{{{{{{{{\rm{II}}}}}}}}}^{+}\,{J}_{{{{{{{{\rm{GABA}}}}}}}},{{{{{{{\rm{I}}}}}}}}}\end{array}\right).$$

(39)

Only excitatory populations (i = 1 and i = 2) receive stimulus information through ${I}_{{{{{{{{\rm{stim}}}}}}}}}$, which is identical to Eq. (4). Noise is introduced by I_ν,i which is implemented as in the two-variable model (Eq. (5)) with the standard deviation of ν(t) set to 0.2 nA.

The weight parameters ${w}_{{{{{{{{\rm{EE}}}}}}}}}^{+}$, ${w}_{{{{{{{{\rm{EE}}}}}}}}}^{-}$, ${w}_{{{{{{{{\rm{EI}}}}}}}}}^{+}$, ${w}_{{{{{{{{\rm{EI}}}}}}}}}^{-}$, ${w}_{{{{{{{{\rm{IE}}}}}}}}}^{+}$, ${w}_{{{{{{{{\rm{IE}}}}}}}}}^{-}$, ${w}_{{{{{{{{\rm{II}}}}}}}}}^{+}$, and ${w}_{{{{{{{{\rm{II}}}}}}}}}^{-}$ were defined as in the two-variable model. The difference is the addition of ${w}_{{{{{{{{\rm{II}}}}}}}}}^{+}$, and ${w}_{{{{{{{{\rm{II}}}}}}}}}^{-}$ which define the specificity of inhibitory-inhibitory connections and depend on γ_II which can range between [−1, 1]. The synaptic parameters J_NMDA,E, J_NMDA,I, J_GABA,E, J_GABA,I, and the background input currents I_0,E(I) were chosen so that the firing rate dynamics of E₁ and E₂ matched that of the two-variable model on a noiseless trial with a stimulus strength of 0.05 using PyABC parameter inference⁵¹. The values of these parameters are defined in Table 3. Simulations of the four-variable model were performed in Python 3.7.

Table 3 Four-variable mean-field model parameters

Full size table

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

The data used in this study can be reproduced using the source code.

Code availability

The source code to reproduce the results of this study is available on GitHub (https://github.com/engellab/selective-inhibition-models).

References

Gold, J. I. & Shadlen, M. N. The neural basis of decision making. Annu. Rev. Neurosci. 30, 535–574 (2007).
Article CAS Google Scholar
Wang, X.-J. Probabilistic decision making by slow reverberation in cortical circuits. Neuron 36, 955–968 (2002).
Article CAS Google Scholar
Machens, C. K., Romo, R. & Brody, C. D. Flexible control of mutual inhibition: a neural model of two-interval discrimination. Science 307, 1121–1124 (2005).
Article ADS CAS Google Scholar
Wong, K.-F. & Wang, X.-J. A recurrent network mechanism of time integration in perceptual decisions. J. Neurosci. 26, 1314–1328 (2006).
Article CAS Google Scholar
Moreno-Bote, R., Rinzel, J. & Rubin, N. Noise-induced alternations in an attractor network model of perceptual bistability. J. Neurophysiol. 98, 1125 1139 (2007).
Article Google Scholar
Deco, G. & Rolls, E. T. Decision making and Weber’s law: a neurophysiological model. Eur. J. Neurosci. 24, 901–916 (2006).
Article Google Scholar
Roxin, A. & Ledberg, A. Neurobiological models of two-choice decision making can be reduced to a one-dimensional nonlinear diffusion equation. PLoS Comput. Biol. 4, e1000046 (2008).
Article ADS MathSciNet Google Scholar
Wang, X.-J. Decision making in recurrent neuronal circuits. Neuron 60, 215–234 (2008).
Article CAS Google Scholar
Atiya, N. A. A., Rañó, I., Prasad, G. & Wong-Lin, K. A neural circuit model of decision uncertainty and change-of-mind. Nat. Commun. 10, 2287 (2019).
Article ADS Google Scholar
Lisman, J. E., Fellous, J.-M. & Wang, X.-J. A role for NMDA-receptor channels in working memory. Nat. Neurosci. 1, 273–275 (1998).
Article CAS Google Scholar
Wang, M. et al. NMDA receptors subserve persistent neuronal firing during working memory in dorsolateral prefrontal cortex. Neuron 77, 736–749 (2013).
Article CAS Google Scholar
Isaacson, J. S. & Scanziani, M. How inhibition shapes cortical activity. Neuron 72, 231–243 (2011).
Article CAS Google Scholar
Cardin, J. A., Palmer, L. A. & Contreras, D. Stimulus feature selectivity in excitatory and inhibitory neurons in primary visual cortex. J. Neurosci. 27, 10333–10344 (2007).
Article CAS Google Scholar
Hofer, S. B. et al. Differential connectivity and response dynamics of excitatory and inhibitory neurons in visual cortex. Nat. Neurosci. 14, 1045–1052 (2011).
Article CAS Google Scholar
Packer, A. M. & Yuste, R. Dense, unspecific connectivity of neocortical parvalbumin-positive interneurons: a canonical microcircuit for inhibition? J. Neurosci. 31, 13260–13271 (2011).
Article CAS Google Scholar
Yoshimura, Y. & Callaway, E. M. Fine-scale specificity of cortical networks depends on inhibitory cell type and connectivity. Nat. Neurosci. 8, 1552–1559 (2005).
Article CAS Google Scholar
Znamenskiy, P. et al. Functional selectivity and specific connectivity of inhibitory neurons in primary visual cortex. Preprint at https://www.biorxiv.org/content/10.1101/294835v2 (2018).
Poort, J. et al. Learning and attention increase visual response selectivity through distinct mechanisms. Neuron 110, 686–697.e6 (2021).
Pinto, L. & Dan, Y. Cell-type-specific activity in prefrontal cortex during goal-directed behavior. Neuron 87, 437–450 (2015).
Article CAS Google Scholar
Allen, W. E. et al. Global representations of goal-directed behavior in distinct cell types of mouse neocortex. Neuron 94, 891–907.e6 (2017).
Article ADS Google Scholar
Najafi, F. et al. Excitatory and inhibitory subnetworks are equally selective during decision-making and emerge simultaneously during learning. Neuron 105, 165–179 (2019).
Article Google Scholar
Mahajan, N. R. & Mysore, S. P. Donut-like organization of inhibition underlies categorical neural responses in the midbrain. Nat. Commun. 13, 1680 (2022).
Article ADS CAS Google Scholar
Kim, R. & Sejnowski, T. J. Strong inhibitory signaling underlies stable temporal dynamics and working memory in spiking neural networks. Nat. Neurosci. 24, 129–139 (2021).
Article CAS Google Scholar
Wickelgren, W. A. Speed-accuracy tradeoff and information processing dynamics. Acta Psychol. 41, 67–85 (1977).
Article Google Scholar
Heitz, R. P. & Schall, J. D. Neural mechanisms of speed-accuracy tradeoff. Neuron 76, 616–628 (2012).
Article CAS Google Scholar
Heitz, R. P. The speed-accuracy tradeoff: history, physiology, methodology, and behavior. Front. Neurosci. 8, 150 (2014).
Article Google Scholar
Standage, D., Wang, D.-H. & Blohm, G. Neural dynamics implement a flexible decision bound with a fixed firing rate for choice: a model-based hypothesis. Front. Neurosci. 8, 318 (2014).
Article Google Scholar
Song, H. F., Yang, G. R. & Wang, X.-J. Training excitatory-inhibitory recurrent neural networks for cognitive tasks: a simple and flexible framework. PLoS Comput. Biol. 12, e1004792 (2016).
Article ADS Google Scholar
Mazurek, M. E., Roitman, J. D., Ditterich, J. & Shadlen, M. N. A role for neural integrators in perceptual decision making. Cereb. Cortex 13, 1257–1269 (2003).
Article Google Scholar
Lim, S. & Goldman, M. S. Balanced cortical microcircuitry for maintaining information in working memory. Nat. Neurosci. 16, 1306–1314 (2013).
Article CAS Google Scholar
Niyogi, R. K. & Wong-Lin, K. Dynamic excitatory and inhibitory gain modulation can produce flexible, robust and optimal decision-making. PLoS Computational Biol. 9, e1003099 (2013).
Article ADS MathSciNet CAS Google Scholar
Eckhoff, P., Wong-Lin, K. F. & Holmes, P. Optimality and robustness of a biophysical decision-making model under norepinephrine modulation. J. Neurosci. 29, 4301–4311 (2009).
Article CAS Google Scholar
Sussillo, D. Neural circuits as computational dynamical systems. Curr. Opin. Neurobiol. 25, 156–163 (2014).
Article CAS Google Scholar
Eccles, J. C., Fatt, P. & Koketsu, K. Cholinergic and inhibitory synapses in a pathway from motor axon collaterals to motoneurones. J. Physiol. 126, 524–562 (1954).
Article CAS Google Scholar
Kim, R., Li, Y. & Sejnowski, T. J. Simple framework for constructing functional spiking recurrent neural networks. Proc. Natl Acad. Sci. USA 116, 22811–22820 (2019).
Article ADS CAS Google Scholar
Ma, W.-p et al. Visual representations by cortical somatostatin inhibitory neurons—selective but with weak and delayed responses. J. Neurosci. 30, 14371–14379 (2010).
Article CAS Google Scholar
Moore, A. K. & Wehr, M. Parvalbumin-expressing inhibitory interneurons in auditory cortex are well-tuned for frequency. J. Neurosci. 33, 13713–13723 (2013).
Article CAS Google Scholar
Lee, S.-H. et al. Activation of specific interneurons improves V1 feature selectivity and visual perception. Nature 488, 379–383 (2012).
Article ADS CAS Google Scholar
Sederberg, A. & Nemenman, I. Randomly connected networks generate emergent selectivity and predict decoding properties of large populations of neurons. PLoS Comput. Biol. 15, e1007875 (2019).
Google Scholar
Soma, S., Shimegi, S., Osaki, H. & Sato, H. Cholinergic modulation of response gain in the primary visual cortex of the macaque. J. Neurophysiol. 107, 283–291 (2012).
Article CAS Google Scholar
Soma, S., Shimegi, S., Suematsu, N. & Sato, H. Cholinergic modulation of response gain in the rat primary visual cortex. Sci. Rep. 3, 1138 (2013).
Article ADS Google Scholar
Soma, S., Shimegi, S., Suematsu, N., Tamura, H. & Sato, H. Modulation-specific and laminar-dependent effects of acetylcholine on visual responses in the rat primary visual cortex. PLoS ONE 8, e68430 (2013).
Article ADS CAS Google Scholar
Salgado, H. et al. Muscarinic M2 and M1 receptors reduce GABA release by Ca2+ channel modulation through activation of PI3K/Ca2+ -independent and PLC/Ca2+ -dependent PKC. J. Neurophysiol. 98, 952–965 (2007).
Article CAS Google Scholar
Liu, B., Lo, C.-C. & Wu, K.-A. Choose carefully, act quickly: efficient decision making with selective inhibition in attractor neural networks. Preprint at https://www.biorxiv.org/content/10.1101/2021.10.05.463257v2.full (2021).
Lam, N. H. et al. Effects of altered excitation-inhibition balance on decision making in a cortical circuit model. J. Neurosci. 42, 1035–1053 (2022).
Churchland, A. K., Kiani, R. & Shadlen, M. N. Decision-making with multiple alternatives. Nat. Neurosci. 11, 693–702 (2008).
Article CAS Google Scholar
Cisek, P., Puskas, G. A. & El-Murr, S. Decisions in changing conditions: the urgency-gating model. J. Neurosci. 29, 11560–11571 (2009).
Article CAS Google Scholar
Thura, D., Beauregard-Racine, J., Fradet, C.-W. & Cisek, P. Decision making by urgency gating: theory and experimental support. J. Neurophysiol. 108, 2912–2930 (2012).
Article Google Scholar
Carland, M. A., Thura, D. & Cisek, P. The urgency-gating model can explain the effects of early evidence. Psychonomic Bull. Rev. 22, 1830–1838 (2015).
Article Google Scholar
Finkelstein, A. et al. Attractor dynamics gate cortical information flow during decision-making. Nat. Neurosci. 24, 843–850 (2021).
Article CAS Google Scholar
Schälte, Y. et al. pyABC: Efficient and robust easy-to-use approximate Bayesian computation. J. Open Source Softw. 7, 4304 (2022).
Article ADS Google Scholar

Download references

Acknowledgements

This work was supported by the NIH grants F32MH123011 (J.P.R.), R01 EB026949 (A.K.C. and T.A.E.), and 2R01EY022979 (A.K.C.), Alfred P. Sloan Foundation Research Fellowship (T.A.E.), and the ISQEB program at the Simons Center for Quantitative Biology at CSHL (J.P.R.). Computer simulations for this work were performed with assistance from the NIH Grant S10OD028632-01. We thank M. Genkin for thoughtful comments on the manuscript.

Author information

Authors and Affiliations

Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA
James P. Roach & Tatiana A. Engel
Department of Neurobiology, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, CA, USA
James P. Roach & Anne K. Churchland

Authors

James P. Roach
View author publications
You can also search for this author in PubMed Google Scholar
Anne K. Churchland
View author publications
You can also search for this author in PubMed Google Scholar
Tatiana A. Engel
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

J.P.R., A.K.C and T.A.E. designed the research. J.P.R. developed the code and performed computer simulations. J.P.R., A.K.C and T.A.E. wrote the paper.

Corresponding author

Correspondence to Tatiana A. Engel.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks Jorge Mejias and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Reporting Summary

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Roach, J.P., Churchland, A.K. & Engel, T.A. Choice selective inhibition drives stability and competition in decision circuits. Nat Commun 14, 147 (2023). https://doi.org/10.1038/s41467-023-35822-8

Download citation

Received: 16 February 2022
Accepted: 03 January 2023
Published: 10 January 2023
DOI: https://doi.org/10.1038/s41467-023-35822-8

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Computational complexity drives sustained deliberation

Initial conditions combine with sensory evidence to induce decision-related dynamics in premotor cortex

Adaptive circuit dynamics across human cortex during evidence accumulation in changing environments

Introduction

Results

Inhibitory connection specificity expands the space of circuits that support decision making

Inhibitory motif controls the speed versus accuracy trade-off

Strong ipsispecific inhibition destabilizes working memory

Inhibitory choice selectivity in trained recurrent neural networks

Excitatory specificity aligns with ispi- and contraspecific inhibitory motifs in RNNs

Perturbing inhibitory neuron activity reveals regimes where stabilizing and competitive inhibition dominate

Discussion

Methods

Mean-field model

Circuit structure

Evaluation of circuit performance

Phase plane and bifurcation analysis

Recurrent neural network models

RNN training

Measuring choice selectivity of RNN units

Measuring connection specificity in RNNs

Perturbing inhibitory populations

Four-variable mean-field model

Reporting summary

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Peer review

Peer review information

Additional information

Supplementary information

Supplementary Information

Peer Review File

Reporting Summary

Rights and permissions

About this article

Cite this article

Share this article

Comments

Search

Quick links