Fluid network dynamics in the prefrontal cortex during multiple strategy switching

Malagon-Vina, Hugo; Ciocchi, Stephane; Passecker, Johannes; Dorffner, Georg; Klausberger, Thomas

doi:10.1038/s41467-017-02764-x

Download PDF

Article
Open access
Published: 22 January 2018

Fluid network dynamics in the prefrontal cortex during multiple strategy switching

Hugo Malagon-Vina¹,
Stephane Ciocchi¹^nAff3,
Johannes Passecker ORCID: orcid.org/0000-0002-4366-2691¹,
Georg Dorffner² &
…
Thomas Klausberger¹

Nature Communications volume 9, Article number: 309 (2018) Cite this article

6610 Accesses
27 Citations
124 Altmetric
Metrics details

Subjects

Abstract

Coordinated shifts of neuronal activity in the prefrontal cortex are associated with strategy adaptations in behavioural tasks, when animals switch from following one rule to another. However, network dynamics related to multiple-rule changes are scarcely known. We show how firing rates of individual neurons in the prelimbic and cingulate cortex correlate with the performance of rats trained to change their navigation multiple times according to allocentric and egocentric strategies. The concerted population activity exhibits a stable firing during the performance of one rule but shifted to another neuronal firing state when a new rule is learnt. Interestingly, when the same rule is presented a second time within the same session, neuronal firing does not revert back to the original neuronal firing state, but a new activity-state is formed. Our data indicate that neuronal firing of prefrontal cortical neurons represents changes in strategy and task-performance rather than specific strategies or rules.

A persistent prefrontal reference frame across time and task rules

Article Open access 08 March 2024

Phase of firing coding of learning variables across the fronto-striatal network during feature-based learning

Article Open access 16 September 2020

Long-range inhibition synchronizes and updates prefrontal task activity

Article Open access 26 April 2023

Introduction

In ancient Greek, Heraclitus famously stated that “No man ever steps in the same river twice, for it’s not the same river and he’s not the same man”. He referred to the ambiguity that conscious actions and plans are never truly experienced the same way, however similar they may appear. Earlier research demonstrated how distinct behavioural rules and strategies are entailed in the activity of prefrontal neurons^1,2,3. We aim to address the question how a certain behavioural strategy, applied on two different occasions within the same session, is represented in the neuronal firing rate of the prefrontal cortex. The prefrontal cortex is a central structure for executive control of flexible behaviour to assess new rules and strategies, not only in humans^4,5 but also in monkeys^6,7 and rodents^8,9,10, and is highly interconnected with other brain regions^11,12 indicative of an integrative structure¹³ and a multifunctional role during cognitive tasks¹⁴. The importance of the prefrontal cortex is highlighted by neuronal firing patterns contributing to error-related activity^15,16,17, working memory^{18,19,20,21,22,23}, decision making^{24,25,26,27,28}, and reward encoding^{29,30,31,32,33}. It has been shown that lesions of the medial prefrontal cortex lead to an impairment of the ability to follow changing spatial rules³⁴. Prefrontal neurons also change their firing activity when animals are switching between different strategies¹. This indicates the importance of the prefrontal cortex for rule-guided behaviour. In addition, behavioural rule changes can lead to abrupt neuronal population activity changes within a very short period of time². Furthermore, abrupt and coordinated changes of neuronal firing have been reported when animal behaviour reflects uncertainty and evaluation of possible new strategies³. However, the following questions remain unclear: (1) how multiple and consecutive changes of strategy would be reflected in the firing of prefrontal neurons and (2) how the same repeated strategy would be represented on different occasions.

To address neuronal computations in consecutive rule presentations, we used single-unit recordings in freely moving rats to assess neuronal activity during a prefrontal cortex-dependent rule-switching task¹⁰. In our task design, animals managed to perform under multiple rules during a single recording session, which allowed us to study the neuronal representations when animals changed new rules and acquired new strategies. Additionally, we examined how neuronal population states are changing during the presentation and repetition of multiple rules in the course of a single session.

The results of this study show that the neuronal population forms a different and stable firing-state every time a new rule is learnt, even when the same rule is presented twice during the same session. This implies that the concerted neuronal population in the prelimbic cortex does not represent individual rules permanently, but it reflects a change of strategies.

Results

A strategy-switching task with multiple rule changes

Rats (n = 5) performed a strategy-switching task with multiple rule changes (from 1 up to 6 changes, median = 3) within each behavioural session (Fig. 1a), while the activity of neurons in the prefrontal cortex was extracellularly recorded with 12 tetrodes (Fig. 1b). Rats were seeking a food reward on a plus-maze using one out of four possible strategies based on two allocentric (landmark-referenced) and two egocentric (self-referenced) rules. The animals were placed at one of the two possible starting arms (North or South arm) and they had to decide to run towards one of the two goal arms (East or West arm), while the arm opposite of the starting position was blocked. After reaching the end of the goal arm, a reward was given for a correct choice according to the current rule. Then the animal was manually positioned into a bin at the centre of the maze to break stereotyped behaviour. After 3–7 s, the rat was placed again at one of the starting arms to begin the next trial. When an animal successfully succeeded in performing 13 out 15 consecutive trials, the rule was changed without notice and the animal had to switch strategy in order to maximise reward based on trial and error information. We analysed the performance of the animal using the behavioural choices (correct and incorrect) using a Markov-chain Monte–Carlo analysis³⁵ (Fig. 1c), which defines the probability of the rat being correct during each trial together with the associated confidence intervals. Those intervals are used to determine learning periods (see Methods). Only one recording session was carried out on a single day.

We performed behavioural control experiments in two rats to assess whether animals indeed used landmarks during allocentric but not egocentric strategies. The maze was surrounded by four walls, which displayed distinct and large landmarks for the orientation of the rats during task navigation. The availability of landmark cues during allocentric and egocentric strategies was controlled by keeping or removing all landmarks for 10 trials, following the successful learning of a rule. When animals were allowed to use the visual landmarks during the allocentric rules, they performed significantly better than without the availability of landmarks (p = 0.0004). However, during egocentric rules, the removal or addition of landmarks did not alter performance (p = 0.53) (Fig. 1d). This indicates that rats, indeed, followed allocentric or egocentric strategies during the respective rules to maximise their reward.

Neuronal firing correlates with task-performance

A total of 300 neurons were recorded with tetrodes in the prefrontal cortex in three animals during the performance of the strategy-switching task. For the recording sessions, we determined the trial-by-trial time-series of neuronal firing rates for the entire trial, by diving the number of spikes of a neuron by the time of the trial. To test the consistency of our findings, we divided each trial into 3 non-overlapping behavioural trial segments defined as run, reward and inter-trial period (see Methods). By examining the firing rate of individual neurons during consecutive trials and different behavioural segments, we observed the following two groups of neurons (Fig. 2a, b): (1) positively correlated neurons that had an increased firing rate during behavioural periods with good performance, and (2) negatively correlated neurons, which had an increased firing rate during periods with low performance, when negative feedback in form of repeated lack of reward with a conflicting understanding of the task was experienced. Out of 300 recorded units, we identified neurons (n = 95, 54, 74 and 84 for the entire trial, run, reward and inter-trial period, respectively) with either significant negative or positive correlations between their firing rate and task performance (Fig. 2c, Spearman's correlation, α = 0.05, Bonferroni–Holm correction; the data for different animals are shown in Supplementary Fig. 1). The cumulative distribution function of the correlations (n = 300 neurons) in different trial segments (Fig. 2d) indicate that subsets of prefrontal neurons exhibit either a positively or negatively correlation of firing rate with task performance of the animal. To demonstrate that the correlations observed between firing rate and task performance are rather a product of the integration of outcomes over time rather than reflecting an instantaneously received reward or lack thereof, we shuffled the order of trials while keeping the firing rate and correct or incorrect performance associated with each trial, and generate a new shuffled performance curve. Then, we tested for possible correlations between firing rate and shuffled performance and observed that the correlations obtained from shuffled trials were significantly lower than those derived from observed data for all episodes as well as for the entire trial time (Fig. 2e, difference tested with Kolmogorov–Smirnov test). This indicates that correlations between firing rates of neurons and performance of the animal correspond to the integration of previously experienced trials and not to the instantaneous response to the reward obtained in each trial.

As fast-spiking GABAergic interneurons might be important for computations involving negative feedback and conflicting results^36,37,38, we divided neurons into two groups: one group of neurons with firing rate higher or equal to 10 Hz, in which an enriched—but not exclusive—population of fast-spiking interneurons is expected, and another group of neurons with firing rates lower than 10 Hz, in which an enriched—but not exclusive—population of pyramidal cells is expected (Fig. 3a). When compared with low-firing neurons, a larger fraction of high-firing neurons had significantly correlated firing rates with task performance, (Fig. 3b, p = 0.024, chi-square test) and also displayed an enriched tendency to be negatively correlated (Fig. 3b, p = 0.014, Χ² test).

Prefrontal neurons reflect changes in rules and strategies

The neuronal state representation in the prefrontal cortex changes during rule switches². However, the neuronal state dynamics remain unclear when multiple rule changes are presented in the same recording session. As the activity of the prefrontal cells reflects behavioural performance, we tested changes in population activity during multiple rules presentations in a single recording session. For this, each trial was represented as a population vector of neuronal activity: $T = FR_1,FR_2,...,FR_n$, where $FR_n$ is the firing rate of neuron ‘n’ during trial T. For this analysis, we only used trials during learning and learnt phases of the presented rules. We excluded naive phases because they consist of mainly persistent behaviour of the animal still following the previous rule (Fig. 4a). We observed that the N-dimensional population vectors of neuronal activity belonging to one rule tended to be clustered, at least when dimensionality reduction using a principal component analyses was applied (Fig. 4b). To test this quantitatively and without dimensionality reduction, we first applied a K-means clustering algorithm (see Methods section) to the N-dimensional representation, specifying the number of clusters as the number of rules that should be found. The algorithm assigned each of the trials to a specific “rule cluster”. We compared this assignment with the actual rules during the respective trial and created an accuracy index (number of correctly grouped trials over the total number of trials). We tested the null hypothesis that population vectors, defined by the firing rates, do not cluster according to rules using a permutation test. We shuffled the order of trials keeping the firing rate associated with each trial, applied the K-means algorithm and recomputed the accuracy index. The accuracy of the K-means algorithm is significantly higher for the observed data than for the shuffled data, demonstrating that there is a clustered organisation of the trials (Fig. 4c, p = 3e−5, Wilcoxon signed-rank test and Supplementary Fig. 2a for data from individual animals). We could confirm that the centre of mass of the N-dimensional cloud belonging to a rule can be used as a representation of the clustered rule (Fig. 4d and Supplementary Fig. 2b). Clustering using Mahalanobis distance also produce a significant difference between observed and shuffled data (Supplementary Fig. 3a, p = 1.07e−04, Wilcoxon signed-rank test). We found a significant difference (p = 1.8e-5, Wilcoxon signed-rank test) between observed and shuffled data, when the clusters were defined by the centre of mass. Overall, due to the observation that N-dimensional population vectors of neuronal activity can be clustered, this implies that neuronal activity of the prefrontal cortex holds some form of a representation of rules.

Nonetheless, it has been shown previously that representations in the prefrontal cortex are drifting over time³⁹, which may account for the clustered organisation of our data across consecutive trials and rules. To address this possibility, we performed a multiple linear regression in which we explained the distance between the N-dimensional centres of two clustered rules either by the number of trials (as a measure of time) or by the number of rules separating them. The distance between two rules was calculated by measuring the Euclidean distance between the N-dimensional centres of mass of both clusters. The resulting plane of the regression showed a significant explanation of the distance due to the number of rules in between (p = 2.81e−08) but not due to the trials in between (p = 0.78, Fig. 4e). Supplementary Table 1 shows the p-values segregated per animal. The interaction between both factors was not significant (p = 0.704). To further elucidate the contribution of both time and rules to the distance between the rules, two partial correlations were calculated (Fig. 4f). The partial correlation of the distance between two rules and the number of trials in-between is not significant (p = 0.779, Spearman correlation) when the number of rules in-between is taken into account. On the contrary, the partial correlation of the distance of two rules and the number of rules in-between is significant (p = 2.8e-8, Spearman correlation), even when the number of trials in-between is taken into account. Moreover, when Mahalanobis distance was used instead of Euclidean, similar results were obtained. The Mahalanobis distance between clusters is significantly explained by the number of rules (p = 4.756e−05) but not by the number of trials in between the clusters (p = 0.76405). In addition, the interaction term is not significant (p = 0.646). The partial correlations, using Mahalanobis distances, follow the same tendency (Supplementary Fig. 3b, p = 4.756e-05 (top), p = 0.764 (bottom), Spearman correlation). Overall, this data indicates that the distance of firing vectors between rule clusters is better explained by the presentation of rules per se, rather than by time that has passed between rules, suggesting that rule-dependent switches in strategy may drive the population activity rather than just the passing of time.

Repeated rules do not induct the same firing state

Having discovered a clustering of neural firing patterns in the prefrontal cortex based on rules, this leads to the question whether those firing patterns could reflect persistent representations of a specific rule. To address this question, we presented the animals with the same rule twice within a single recording session (Fig. 5a, b). Strikingly, after visualising the data projected onto the two first principal components, we observed that when the animal faced the same rule a second time later in the session, the neuronal population formed a new state instead of returning to the one initially formed when the rule was presented for the first time (Fig. 5c, d, e, f). To further corroborate that the neuronal population response of the repetition ‘A°’ of a given rule ‘A’ is distinct, we trained two different classifiers: a logistic regression and a support vector machine. The data were divided into a training set and a test set. The training set consisted of 70% of the trials corresponding to the rule ‘A’ and it was labelled as ‘1’, and the trials of all other rules (except the repetition ‘A°’), which were labelled as ‘0’. The test set included the repetition ‘A°’ and the remaining 30% of the trials in ‘A’. After building the decoder with the training set, an accuracy value of belonging to the rule ‘A’ is computed for the test set (number of trials classified as ‘1’ over the total number of trials). The classifiers correctly assigned the data belonging to rule ‘A’ as ‘1’ (belonging to rule ‘A’), while the data of rule ‘A°’ is assigned as ‘0’, indicating a different rule form ‘A’ (Fig. 5g, SVM − > p < 1e−20, Logistic regression − > p < 1e−20). These analyses suggest that the neuronal firing state during a rule repetition is different from the firing during the initial rule.

Interestingly, both the Euclidean and Mahalanobis distance between two repeated rules are not significantly different from the distance between two different rule clusters with the same number of rules in between (Fig. 5h, p = 0.58, Wilcoxon rank-sum test and Supplementary Fig. 3c, p = 0.248, Wilcoxon rank-sum test), indicating that the repetition of the rule is perceived by the prefrontal cortex’s neuronal population in a similar way as if another new rule was presented.

These results strongly advocate that cognitive rules are not specifically represented by unique neuronal firing in the prelimbic and cingulate cortex, but rather by a new formation of neuronal firing states, even during the repetition of rules during the same session.

Speed and trajectory do not explain firing states

It is known that firing in the prefrontal cortex neurons reflects animal trajectories and movement⁴⁰. Thus, it might be possible that the different neuronal firing states during a rule and during its repetition might be due to the change in trajectories or running speed which might variate between the beginning of the recording session and the end. However, trajectories of the animal do not seem to variate much from one rule to its repetition (Fig. 6a, Supplementary Fig. 4). Nonetheless, in order to account for the trajectories and include them in our analyses, we decided to take advantage of the maze’s geometry and fit the trajectories to a quadratic equation: $ax^2 + bx + c$. The three coefficients a, b, c describe the trajectory of an animal in a given trial, and we could extend our analyses to account for movement (Fig. 6b). Differences between trajectories of trials were quantified by the three coefficients of the quadratic fit (Fig. 6c). We compared the distances of trials which have very similar trajectories (lower than 5th percentile of the similarity index distribution) between the first presentation of a rule ‘A’ and its repetition ‘A°’ vs the distances of trials, which have very different trajectories (higher than the 95th percentile of the similarity index distribution) within the rule ‘A’ (Fig. 6d). Even for trials with very similar trajectories between ‘A’ and ‘A°’, the Euclidean distances of firing states are significantly larger (p = 0.0017, Wilcoxon rank-sum test) than those between trials only within ‘A’ with very different trajectories. This implies that the trajectories per se do not explain the separated clustering of the rule repetition.

However, trajectories and speed still might have a more subtle effect. Therefore, we modelled the firing rate of a neuron by using the coefficients of the quadratic equation and the speed of the animal as follows: ${\rm{FR}}_T = \beta _0 + a_T\beta _1 + b_T\beta _2 + c_T\beta _2 + s_T\beta _3$ where FR is the firing rate; a, b, c are the coefficients of the quadratic equation and ‘s’ is the speed in m/s in a given trial ‘T ’.

As expected, close to a 30% of the neurons showed a significant correlation to at least one of the coefficients in each of the 4 possible paths (Supplementary Table 2a, b). After having a quantitative value of the trajectory contribution to the firing rate, we used the residuals of the model (the part of the firing rate that it is not explained by the movement variables) to re-do all the analyses presented in this paper. When using the residuals of the model, similar correlation values between the firing rate and the performance of the animal are maintained for all the neuronal population, as well as for the high firing rate neurons (Fig. 6e, f, r = 0.9123, p < 1 × 10⁻²⁰ and r = 0.9326, p < 1e−20, Spearman correlation).

Moreover, projections of the residuals on the first two principal components of the multi-unit activity, still remain clustered (Fig. 6g) and rule repetitions are found to be still in another cluster different from the first presentation (Fig. 6h). The results of the accuracy of the K-means clustering algorithm applied to the residuals of the firing rates still show a clustered organisation of the rules in the network state (Fig. 6i). The same general linear model previously described and shown in Fig. 4e was now fitted with the residuals data, reaching similar results. The number of rules in between is responsible for the explanation of the cluster’s distance (p = 0.00053) and not the number of trials in between (p = 0.97). In addition, there is not a significant difference between the distances from data of non-repeated and repeated rules (Fig. 6j, p = 0.3944, Wilcoxon rank-sum test).

Overall, these results support that both the neuronal representations of cognitive rules and the new formation of a neuronal firing state upon the repetition of a rule are not an artefact of the movement of the animals.

Discussion

To investigate the computations of neuronal populations in the prefrontal cortex during flexible behaviour, we recorded neuronal activity while animals were performing a prefrontal cortex-dependent strategy-switching task¹. We managed to assess multiple rule changes within the same session, which allowed the possibility to study neuronal population dynamics while following several behavioural changes in strategy. We found two complementary neuronal groups, which dynamically changed their firing during behaviour and their firing rates were significantly correlated with performance. In addition, when multiple rules were presented on the same day, neuronal populations formed new neural states for each rule, even when the same rule was presented twice.

Neurons in the prefrontal cortex have a multitude of different firing patterns, which reflect many aspects of the external environment as well as internal computations, including goal-related firing^21,26,27,41, reward^{29,31,32,42,43,44}, encoding of memory and executive functions^{18,20,41,45,46} or confidence²⁴. But how do diverse firing patterns adapt when the strategy of an animal changes? Lesions^5,34,47,48, optogenetic inactivation⁴⁹ and pharmacological inactivation¹⁰ of the prefrontal cortex have been reported to induce impairment in cognitive flexibility. In the prelimbic cortex, changes in firing rates of single neurons^1,50 and neuronal populations^2,51 have been observed in relation to flexibility in strategy during a rule switching task. Often those changes are presented as a global change of activity and might relate to a complete switch to a different network state, as shown for synchronised changes of neuronal populations during an uncertainty task³. In fact, after lesioning the medial prefrontal cortex, animals could not follow a change of spatial related rules³⁴.

We assessed how changes in network activity are related to the performance of the animal during multiple rule changes that are presented within the same session, reflecting a high demand on cognitive flexibility. In order to perform a goal-guided behaviour, often the right choice between stability and flexibility has to be found. Thus, the prefrontal network operations should be flexible enough to take into account different sensory inputs and experiences, which provide evidence for a different and more successful strategy, but at the same time should remain stable enough to ignore irrelevant information⁵². This may contribute to the reason why neuronal ensembles in the prefrontal cortex often present abrupt changes in activity whenever a new strategy is behaviourally required², denoting a flexible state. During the subsequent learning of the new rule, the representation of neuronal responses tends to be more stable and less sensitive to noise⁵³. We show that firing rates of individual neurons are correlated with the behavioural performance of the rat during a rule switching task. We observed two groups of neuron with a different performance-related activity. For one group of neurons, the firing rate was negatively correlated with the performance of the animal, while for a second, complementary group of neuron, there was a positive correlation. Additionally, we also show that these correlations can be found without taking into account the firing information during reward, demonstrating that it is not only a reward-monitoring value of performance. These complementary neuronal assemblies with opposite correlations between firing rate and performance might reflect the simultaneous ability of the network to maintain rewarding behaviours and perform the task with high success or to induce flexible changes when the success rate is low.

We investigated neuronal population activity by representing each trial as a population vector constructed with the individual firing rates of simultaneously recorded neurons. We defined a specific population vector per trial and a “rule cluster” defined by the combination of points of all the trials belonging to a specific rule. Similar to Durstewitz et al², we show that vector populations of trials belonging to the same rule tended to be clustered together, in contrast to vector populations with an intrinsic random organisation. However, by examining the neuronal population dynamics using population vectors, we extended the rule-change analyses to multiple rule changes presented during the same recording session, including in some cases the repeated presentation of the same rule. Not only was each rule reflected by an independent state representation or “rule cluster” different from firing during other rules, but when the same rule was presented again later in that session, the second presentation fell into a new network state, different from the first presentation of this particular rule. Our results suggest that identical rules in a prefrontal cortex-dependent task will not lead to a similar neuronal encoding, but rather to a new set of prefrontal network activity. In line with Heraclitus’s quote, we provide evidence that the same action plan is not “perceived” identically a second time, as the new experiences shaped a new cognitive state. One explanation for this phenomenon might be that firing rates of the prefrontal cortex are context dependent^{11,39,54,55,56,57} and could be influenced not only by past events but also by multiple inputs that the prefrontal cortex is receiving from other areas^11,12,58. This implies that even when the rule is the same in the second presentation, the actual context in which the rule has been presented differs from the previous presentation. Indeed, due to the fact that time has been shown to be responsible for state changes or drifts in the neuronal activity of the prefrontal cortex³⁹, we decided to test the contribution of time for the new encoding of the repeated rule. Interestingly, when corrected by the number of rules appearing, time was no longer explanatory for the drift (Fig. 4f), but rather the number of multiple rules or rule changes. This leads to the conclusion that it is rather the experience of a new rule and the accompanied strategy switch that contribute to the appearing of new firing state rather than a monotonous drift of the neuronal activity over time. As a second possibility, reservoir neuronal networks related to prefrontal cortex models have been proposed and imply a non-stationary response of neurons during performance of a task^59,60, when neuronal activity is temporarily lifted from a random firing to a stable representation initiated by the requirements of the task. Such periods of modulation might depend on the capability of the network to decode information⁶¹. This would imply that the state into which the network is being driven depends on stochastic processes occurring when the network is not being recruited, bringing a different state representation to the same rule. However, time-dependent processes in the prefrontal cortex are not totally excluded by the reservoir networks, as it has been previously shown that a dynamic reservoir networks in the prefrontal cortex can maintain several distinct timescales of reward memory, thus incorporating previous information and facilitating flexible changes and adaptive reinforcement learning⁶².

In conclusion, we show that individual neuronal firing rates correlate with the fluctuating performance of the rat during changes in strategy. The observed assemblies of those neurons portray the interaction of two groups of opposite but complementary patterns forming stable neuronal populations, which are being dynamically recruited for the purpose of flexible cognitive behaviour. Interestingly, when these populations formed different states for each rule presented, rules were not specifically encoded by a defined population vector as the same rule has two different representations when presented twice during the same session. Our analyses indicate that these observations are independent of the movement of the animals. This evidently advocates that neuronal firing in the prelimbic cortex reflects changes in strategy and task-performance monitoring but does not represent long-term strategies or rules permanently.

Methods

Experimental animals

Five long Evans rats from Charles River Laboratories (male, 300–600 g), were kept in 12 h light cycle during behavioural experiments (performed during light cycle). All experimental procedures were performed under an approved licence of the Austrian Ministry of Science and the Medical University of Vienna.

Surgery and microdrive implantation

Animals were anaesthetised with Isoflurane (induction 4%, maintenance 1–2%; oxygen flow 2 l/min) and fixed on a stereotaxic frame, where body temperature was stabilised using a heating pad. Iodine solution was applied to disinfect the surgery site and eye cream was used to protect the corneas. Local anesthetic (xylocain^® 2%) was used before the incision. In order to avoid dehydration, saline solution was injected subcutaneously every 2 h. Seven stainless steel screws were anchored into the skull to improve the stability of the construct and two of the screws were placed onto the cerebellum as references for the electrophysiological recordings. Subsequently, based on the rat brain atlas⁶³, a craniotomy was performed above the prefrontal cortex area (from bregma: + 3.25 mm anterior, 1 mm lateral, right hemisphere), where, after removal of the dura mater, an array of 12 independently movable, gold plated (100–500 kΩ) wire tetrodes (13 µm insulated tungsten wires, California Fine Wire, Grover Beach, CA) mounted in a microdrive (VersaDrive, State University of New York Downstate Medical Center) were implanted. Then, paraffin wax was applied around the tetrode array and the lower part of the microdrive was cemented (Refobacin® Bone Cement) to the scalp. At the end, the surgery site was sutured and systemic analgesia (metacam^® 2 mg/ml, 0.5 ml/kg) was given. Animals were given post-operative analgesia (Dipidolor 60 mg diluted per 500 ml drinking water) and were allowed at least 7 days of recovery time.

Maze description and behaviour

A strategy switching task based on the consecutive learning of allocentric and egocentric spatial navigation rules¹ was used as a behavioural paradigm. The paradigm was modified to train the animal to perform multiple rule changes over the same session, by changing the rule unannounced every time the criterion for rule learning was achieved (13 out of 15 correct choices). The maze consisted of a plus-maze (55 cm high, made of wood) composed of four arms (80 × 11 cm) with a 90° angle separation between them. Two opposite arms (named North and South) were starting arms and the other two (named East and West) were rewarded arms. The maze was located within four synthetic wall panels to maintain the uniformity of the environment. Three landmarks (triangle, circle, and square made of polystyrene) were visible and attached to different panels. Reward (sucrose pellets, 3 × 20 mg, TestDiet) was delivered by dispensers (Campden Instruments Ltd). The entire automation of the maze (sensors and pellet feeder controls) was controlled by self-made scripts in MATLAB. Tracking of the rats’ movement was monitored by triangulating the signal from three LEDs (red, blue, green) placed on the implanted headstage and recorded at 25 frames per second by an overhead video camera (Sony). One week after surgery, habituation to the maze started. Rats were food-restricted to maintain 85% of their weight. First, during two consecutive days, animals were placed inside of the recording room for a period of around 1–3 h. From the third day on, rats were placed in a ceramic bin at the centre of the maze for an hour and were later positioned on the maze in order to start exploration. Some sugar pellets were placed on the maze and several more at the reward zone. Once the animals learned to explore the maze and consumed the sugar pellets (usually 1–2 days), they were habituated to the activation sound of the pellet feeders, by only giving reward once they entered the reward zone and activated the pellet feeder. After that, the training phase started. It consisted of manually placing the animals at one of the two possible starting points; in this case, the rats had to reach any of the two reward zones, where the reward was given indistinctly. After reward was consumed, rats were manually placed in the bin located at the centre of the maze. Once 30–50 trials were successfully performed by the rat, the full strategy switching task was presented on all subsequent sessions with either rewarded or non-rewarded trials, depending on the actual rule and decision. A typical recording day took place as follows: The rat was kept for 10 min inside of the ceramic bin at the centre of the maze, where baseline recordings were made. Then the task started and the initial rule to be presented was given pseudo-randomly by the script used to control the maze and the task. The animal was taken from the bin and located at one of the two possible starting positions (North or South pseudo-randomly selected) and the bin, where the animal was before was placed to block the opposite starting arm. The rat then ran towards one of the two possible goal arms. After crossing a sensor located at the end of the arm, reward was given in case of being correct. Independently of the correctness of the trial, the animal was taken 2 or 3 s later, placed again in the ceramic bin at the centre of the maze, and after 3–7 s the rat was placed again at one of the two possible starting arms and a new trial was started. The same starting arm could not be selected more than four consecutive times. During the task, when an animal achieved 13 out of 15 possible correct trials, the rule was changed without warning. Four different rules were presented in the task (Fig. 1a): two are allocentric rules (landmark-referenced) during which, independently of the starting point, animals have to go to a reward arm, and the other two rules are egocentric rules (self-referenced) during which, independently of the starting point, the animals should turn to their own right or left. A rule change within the same strategy is defined as a reversal, while a rule change from one strategy to another is defined as a switch. The maze was cleaned with an odour-neutral solution every 10 trials to avoid odour-guided navigation. At least one behavioural switch (allocentric to egocentric or egocentric to allocentric rule learning) was successfully learnt during each session considered and further analysed. To define learning, a state-space model designed by Smith et al.³⁵ was used to analyse the outcome of the rat behaviour during the strategy switching task. Such a state-space model has been previously shown to be more reliable for performance prediction compared to other methods like the change-point test, moving average and the specified number of responses method³¹. The algorithm computes the probability and confidence intervals of the rat choosing the correct arm, taking into account the previous outcomes. The performance curve is defined as this probability. Thus, by using the lower confidence interval of the behavioural curve from the ongoing rule, three different behavioural phases were assigned to each trial (chance level is 0.5)¹: naive (below 0.1 for reversals and 0.3 for switches); learning (between 0.1- and 0.6 for reversals and 0.3-0.6 for switches); and learnt (over 0.6).

The data presented here correspond to behavioural controls and electrophysiological recordings. Behavioural controls were performed in two animals with a total of 48 rules tested (12 allocentric with landmarks, 11 allocentric without landmarks, 12 egocentric with landmarks and 13 egocentric without landmarks) in 17 sessions performed. The electrophysiological data corresponds to 26 sessions performed by 3 animals (8, 12 and 6 sessions, respectively). In total, 66 rules changes were achieved (median number of changes per session = 3). For each session, the rules were selected pseudo-randomly (trying to secure at least a switch between strategies and a change of rules within the same strategy). The median number of trials per session is 98.5 trials.

In vivo electrophysiology

A headstage (HS-132A, 2 × 32 channels, Axona Ltd) was used to pre-amplify the extracellular electric signals from the tetrodes. Output signals were amplified 1000 × via a 64-channel amplifier and then digitised with a sampling rate of 24 kHz at 16-bit resolution, using a 64-channel analogue-to-digital converter computer card (Axona Ltd). Single-unit offline detection was performed by thresholding the digitally filtered signal (0.8 – 5 kHz) over 5 standard deviations from the root mean square on 0.2 ms sliding windows. For each single-unit, 32 data points (1.33 ms) were sampled. A principal component analysis was implemented to extract the first three components of spike waveforms of each tetrode channel⁶⁴.

Spike waveforms from individual neurons were detected using the KlustaKwik automatic clustering software⁶⁵. Using the Klusters software⁶⁶, individual single units were isolated manually by verifying the waveform shape, waveform amplitude across tetrode’s channels, temporal autocorrelation (to assess the refractory period of a single-unit) and cross-correlation (to assess a common refractory period across single-units). The stability of single-units was confirmed by examining spike features over time.

Histology

At the end of recordings, animals were anaesthetised with urethane and micro-lesions were made at the tip of the tetrodes by using a 30 µA unipolar current for 10 s (Stimulus Isolator, World Precision Instruments). Rats were perfused using saline solution, followed by 20 min fixation with 4% paraformaldehyde, 15% (v/v) saturated picric acid and 0.05% glutaraldehyde in 0.1 M phosphate buffer. Serial coronal sections were cut at 70 µm with a vibratome (Leica). Sections containing a lesion were Nissl-stained to verify the position of the tetrodes.

Firing rates and correlations to performance

Firing rates were determined in each recording session and for each neuron by dividing the number of spikes over the corresponding time of the trial. Every trial started the moment that the rat initiated movement from the starting arm towards the end arm. The trial can be divided into 3 different behavioural segments (run, reward, inter-trial), plus another segment which consists on the entire duration of the trial. The run segment was defined as the period from the beginning of rat movement, at the starting arm, towards the reward arm, until the rat crossed a reward sensor located 10 cm at the terminal part of the rewarded area. The reward zone segment was defined as two seconds starting from the moment the animal crossed the reward sensor. The inter-trial segment was defined from the end of the reward segment until the beginning of the next run segment (it comprised a waiting period between 3 and 7 s inside of a bin, plus the time that takes to the animal to start moving and initiate the next trial). The entire trial spans from the start of a trial until the start of the next one. Therefore, in total, 4 firing rates were calculated by counting the spikes present during the specific segment and dividing it by the time spent in that segment. The average length (in seconds) for each segment (except Reward, which is always 2 seconds), specified as median, 25th and 75th percentile were: Run (2.86, 2.28, and 4.34), Inter-trial (19.16, 15.48, and 18.42) and the entire trial (24.3, 20.36, and 32.62).

Correlation of individual firing rates for each different segment and the entire trial were correlated to the observed performance of the animal using Spearman correlation. In addition, Bonferroni–Holm correction was used to correct for false positives. For the comparison made in Fig. 2e, shuffled data were generated. If given a matrix of firing rates FR_mxn, where ‘m’ are the trials and ‘n’ the number of neurons, together with a response vector (‘1’ is correct, ‘0’ is incorrect’) RV_mx1, shuffle was performed by rearranging the rows of both the firing rate matrix and the response vector, therefore disarranging the temporal organisation of the recording session but keeping the relation between firing rate and responses. The performance was recalculated over this new shuffled data, generating a shuffled performance curve, which was later correlated with the firing rates.

Neuronal state-space

Neuronal state-space was formed by defining each trial as a function of the z-scored firing rates of the neurons recorded during that session$T_m = {\mathrm{FR}}_{1m},{\mathrm{FR}}_{2m},...,{\mathrm{FR}}_{nm},$ where ${\mathrm{FR}}_{nm}$ is the firing rate of neuron ‘n’ in that specific trial ($T_m$). The vector of neuronal activity for a given trial was often referred in the text as a population vector. Only trials from the learning and learnt periods were taking into account.

Euclidean and Mahalanobis distance

Calculation of distance between two points was a recurrent analysis in the manuscript. Both Euclidean (1) and Mahalanobis (2) distances were used, ending up with similar results. Euclidean distance is defined as follows:

Given two trials (or centre of masses of rule clusters) T_a and T_b, defined by the firing rate (FR) of all neurons ‘n’ at that trial ${{T}}_a = ({\rm{FR}}_{1a}, {\rm{FR}}_{2a}, \ldots , {\rm{FR}}_{na})$ and ${{T}}_b = ({\rm{FR}}_{1b}, {\rm{FR}}_{2b}, \ldots , {\rm{FR}}_{nb})$ the Euclidean distance between the two trials is:

$${\mathrm{ED}}\left( {T_a,T_b} \right) = \sqrt {\mathop {\sum }\limits_{i = 1}^n \left( {T_{ia} - T_{ib}} \right)^2}$$

(1)

Mahalanobis distance was used mainly between a trial and a rule cluster; therefore, it is defined as follows:

Given a trial ${{T}}_a = \left( {{\rm{FR}}_{1a}, {\rm{FR}}_{2a}, \ldots , {\rm{FR}}_{na}} \right)$ and a rule cluster with mean ${\mathrm{RM}} = ({\rm{MFR}}_1, {\rm{MFR}}_2, \ldots , {\rm{MFR}}_n)$ where MFR_n is the mean firing rate of a cell (n) over all the trials conforming the rule cluster, the Mahalanobis distance between the trial and the rule cluster is:

$${\rm{MD}}\left( {T_a,\,{\rm{RM}}} \right) = \sqrt {\left( {T_a - {\rm{RM}}} \right)^\prime S^{ - 1}\left( {T_a - {\rm{RM}}} \right)},$$

(2)

where the operator (‘) is the transpose and S⁻¹ is the inverse of the covariance matrix.

Similarity index

The similarity index was defined as the normalised Euclidean distance between the coefficients of the quadratic equation modelling a trajectory. The higher the index, the more different are the trajectories.

Any given trajectory (TR) in a trial (m) can be represented with the coefficients of a quadratic equation:

$${\rm{TR}}_m = a_mx + b_mx + c_m$$

(3)

The normalised Euclidean distance (nED) between two trajectories is defined as:

$${\rm{nED}}\left( {{\rm{TR}}_1, {\rm{TR}}_2} \right)\sqrt {W_a\left( {a_2 - a_1} \right)^2 + W_b\left( {b_2 - b_1} \right)^2 + W_c\left( {c_2 - c_1} \right)^2},$$

(4)

where $W_a = \frac{1}{{{\rm{max}}\left( {a_m} \right) - {\mathrm{min}}(a_m)}}$, $W_b = \frac{1}{{{\rm{max}}\left( {b_m} \right) - {\mathrm{min}}(b_m)}}$, $W_c = \frac{1}{{{\rm{max}}\left( {c_m} \right) - {\mathrm{min}}(c_m)}}$

Clustering

Clustering was performed using the K-means clustering algorithm. This method requires the number of clusters to be given, and initialisation of the cluster centres (one per cluster to be found). Then, we computed the distance from each point to each centroid and assigned the point to the centroid from which the shortest distance is measured. Thus, a new centroid is calculated from the newly assigned points and the iteration continued until it found a balance (no change in the assignment of new points). For the data in Fig. 4c, we used unsupervised clustering based on K-means. We assigned each cluster one rule according to the majority of times this cluster represented a state corresponding to that rule. We compared the assigned rule values to the actual rule values and created an accuracy percentage by dividing the positively assigned rule values by the total number of points. We calculated the accuracy for both the observed and shuffled data.

Classifiers

A logistic regression and a support vector machine with linear kernel were used to create two classifiers which classify a trial (represented by the z-scored firing rate of all the neurons during that trial) to belonging to a given rule ‘A’. The following procedure was done in each recording session, which contained a rule repetition (‘A’ as the rule and ‘A°’ as the rule repetition): Classifiers were trained with a set consisting of random selection of 70% of the trials of rule A, labelled as ‘1’, and the rest of trials of the other rules labelled as ‘0’. Trials of the rule repetition ‘A°’ were not included in the training set. Classifiers were later tested by a data composed of the trials in the rule repetition ‘A°’ and the remaining 30% of trials of rule ‘A’, not used in the training set. Due to the random assignment of trials of rule ‘A’, to both training and test set, the procedure was repeated 1000 times. Figure 5g shows the classification of the test data as being part of rule A, divided by either the tested trials in ‘A’ or the trials in the repetition ‘A°’ (Accuracy of belonging to rule ‘A’).

All calculations were made in MATLAB (Mathworks, version R2009b and R2015b.) and statistical analyses were performed with MATLAB and Microsoft Excel. All the statistical tests used were non-parametric with Bonferroni–Holm correction unless stated differently in the text. The data used in linear regressions were checked for homoscedasticity. The principal component reduction was performed using a dimensionality reduction toolbox for Matlab⁶⁷.

Code availability

The computer code that supports the findings of this study is available from the corresponding authors upon reasonable request.

Data availability

The data that support the findings on this study are available from the corresponding authors upon reasonable request.

References

Rich, E. L. & Shapiro, M. Rat prefrontal cortical neurons selectively code strategy switches. J. Neurosci. 29, 7208–7219 (2009).
Article CAS PubMed PubMed Central Google Scholar
Durstewitz, D., Vittoz, N. M., Floresco, S. B. & Seamans, J. K. Abrupt transitions between prefrontal neural ensemble states accompany behavioral transitions during rule learning. Neuron 66, 438–448 (2010).
Article CAS PubMed Google Scholar
Karlsson, M. P., Tervo, D. G. & Karpova, A. Y. Network resets in medial prefrontal cortex mark the onset of behavioral uncertainty. Science 338, 135–139 (2012).
Article ADS CAS PubMed Google Scholar
Milner, B. Effects of different brain lesions on card sorting: the role of the frontal lobes. Arch Neurol. 9(1), 90–100 (1963).
Szczepanski, S. M. & Knight, R. T. Insights into human behavior from lesions to the prefrontal cortex. Neuron 83, 1002–1018 (2014).
Article CAS PubMed PubMed Central Google Scholar
Dias, R., Robbins, T. W. & Roberts, A. C. Dissociable forms of inhibitory control within prefrontal cortex with an analog of the Wisconsin Card Sort Test: restriction to novel situations and independence from “on-line” processing. J. Neurosci. 17, 9285–9297 (1997).
CAS PubMed Google Scholar
Buckley, M. J. et al Dissociable components of rule-guided behavior depend on distinct medial and prefrontal regions. Science 325, 52–58 (2009).
Article ADS CAS PubMed Google Scholar
Joel, D., Weiner, I. & Feldon, J. Electrolytic lesions of the medial prefrontal cortex in rats disrupt performance on an analog of the Wisconsin Card Sorting Test, but do not disrupt latent inhibition: implications for animal models of schizophrenia. Behav. Brain Res. 85, 187–201 (1997).
Article CAS PubMed Google Scholar
Uylings, H. B. M., Groenewegen, H. J. & Kolb, B. Do rats have a prefrontal cortex?. Behav. Brain Res. 146, 3–17 (2003).
Article PubMed Google Scholar
Rich, E. L. & Shapiro, M. L. Prelimbic/infralimbic inactivation impairs memory for multiple task switches, but not flexible selection of familiar tasks. J. Neurosci. 27, 4747–4755 (2007).
Article CAS PubMed Google Scholar
Miller, E. K. The prefrontal cortex and cognitive control. Nat. Rev. Neurosci. 1, 59–65 (2000).
Article CAS PubMed Google Scholar
Croxson, P. L. et al Quantitative investigation of connections of the prefrontal cortex in the human and macaque using probabilistic diffusion tractography. J. Neurosci. 25, 8854–8866 (2005).
Article CAS PubMed Google Scholar
Miller, E. K. & Cohen, J. D. An integrative theory of prefrontal cortex function. Annu. Rev. Neurosci. 24, 167–202 (2001).
Article CAS PubMed Google Scholar
Euston, D. R., Gruber, A. J. & McNaughton, B. L. The role of medial prefrontal cortex in memory and decision making. Neuron 76, 1057–1070 (2012).
Article CAS PubMed PubMed Central Google Scholar
Shidara, M. & Richmond, B. J. Anterior cingulate: single neuronal signals related to degree of reward expectancy. Science 296, 1709–1711 (2002).
Article ADS PubMed Google Scholar
Ito, S., Stuphorn, V., Brown, J. W. & Schall, J. D. Performance monitoring by the anterior cingulate cortex during saccade countermanding. Science 302, 120–122 (2003).
Article ADS CAS PubMed Google Scholar
Hyman, J. M., Holroyd, C. B. & Seamans, J. K. A novel neural prediction error found in anterior cingulate cortex ensembles. Neuron 95, 447–456 e443 (2017).
Article CAS PubMed Google Scholar
Funahashi, S., Bruce, C. J. & Goldman-Rakic, P. S. Mnemonic coding of visual space in the monkey’s dorsolateral prefrontal cortex. J. Neurophysiol. 61, 331–349 (1989).
Article CAS PubMed Google Scholar
Goldman-Rakic, P. S. Cellular basis of working memory. Neuron 14, 477–485 (1995).
Article CAS PubMed Google Scholar
Baeg, E. H. et al. Dynamics of population code for working memory in the prefrontal cortex. Neuron 40, 177–188 (2003).
Article CAS PubMed Google Scholar
Jones, M. W. & Wilson, M. A. Theta rhythms coordinate hippocampal-prefrontal interactions in a spatial memory task. PLoS Biol. 3, e402 (2005).
Article PubMed PubMed Central Google Scholar
Fujisawa, S. & Buzsáki, G. A 4 Hz oscillation adaptively synchronizes prefrontal, VTA, and hippocampal activities. Neuron 72, 153–165 (2011).
Article CAS PubMed PubMed Central Google Scholar
Ma, L., Skoblenick, K., Seamans, J. K. & Everling, S. Ketamine-Induced changes in the signal and noise of rule representation in working memory by lateral prefrontal neurons. J. Neurosci. 35, 11612–11622 (2015).
Article CAS PubMed Google Scholar
Kepecs, A., Uchida, N., Zariwala, H. A. & Mainen, Z. F. Neural correlates, computation and behavioural impact of decision confidence. Nature 455, 227–231 (2008).
Article ADS CAS PubMed Google Scholar
Kim, S., Hwang, J. & Lee, D. Prefrontal coding of temporally discounted values during intertemporal choice. Neuron 59, 161–172 (2008).
Article CAS PubMed PubMed Central Google Scholar
Benchenane, K. et al Coherent theta oscillations and reorganization of spike timing in the hippocampal- prefrontal network upon learning. Neuron 66, 921–936 (2010).
Article CAS PubMed Google Scholar
Sul, J. H., Kim, H., Huh, N., Lee, D. & Jung, M. W. Distinct roles of rodent orbitofrontal and medial prefrontal cortex in decision making. Neuron 66, 449–460 (2010).
Article CAS PubMed PubMed Central Google Scholar
Hosokawa, T., Kennerley, S. W., Sloan, J. & Wallis, J. D. Single-neuron mechanisms underlying cost-benefit analysis in frontal cortex. J. Neurosci. 33, 17385–17397 (2013).
Article CAS PubMed PubMed Central Google Scholar
Peters, Y. M., O’Donnell, P. & Carelli, R. M. Prefrontal cortical cell firing during maintenance, extinction, and reinstatement of goal-directed behavior for natural reward. Synapse 56, 74–83 (2005).
Article CAS PubMed Google Scholar
Amiez, C., Joseph, J. P. & Procyk, E. Reward encoding in the monkey anterior cingulate cortex. Cereb. Cortex 16, 1040–1055 (2006).
Article CAS PubMed Google Scholar
Asaad, W. F. & Eskandar, E. N. Encoding of both positive and negative reward prediction errors by neurons of the primate lateral prefrontal cortex and caudate nucleus. J. Neurosci. 31, 17772–17787 (2011).
Article CAS PubMed PubMed Central Google Scholar
Kvitsiani, D. et al Distinct behavioural and network correlates of two interneuron types in prefrontal cortex. Nature 498, 363–366 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Ma, L., Hyman, J. M., Phillips, A. G. & Seamans, J. K. Tracking progress toward a goal in corticostriatal ensembles. J. Neurosci. 34, 2244–2253 (2014).
Article CAS PubMed Google Scholar
Guise, K. G. & Shapiro, M. L. Medial prefrontal cortex reduces memory interference by modifying hippocampal encoding. Neuron 94, 183–192 e188 (2017).
Article CAS PubMed Google Scholar
Smith, A. C. et al Dynamic analysis of learning in behavioral experiments. J. Neurosci. 24, 447–461 (2004).
Article CAS PubMed Google Scholar
Lim, S. & Goldman, M. S. Balanced cortical microcircuitry for spatial working memory based on corrective feedback control. J. Neurosci. 34, 6790–6806 (2014).
Article CAS PubMed PubMed Central Google Scholar
Miller, P. & Wang, X. J. Inhibitory control by an integral feedback signal in prefrontal cortex: a model of discrimination between sequential stimuli. Proc. Natl Acad. Sci. Usa. 103, 201–206 (2006).
Article ADS CAS PubMed Google Scholar
Tanaka, S. Computational approaches to the architecture and operations of the prefrontal cortical circuit for working memory. Prog. Neuropsychopharmacol. Biol. Psychiatry 25, 259–281 (2001).
Article CAS PubMed Google Scholar
Hyman, J. M., Ma, L., Balaguer-Ballester, E., Durstewitz, D. & Seamans, J. K. Contextual encoding by ensembles of medial prefrontal cortex neurons. Proc. Natl Acad. Sci. USA 109, 5086–5091 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Euston, D. R. & McNaughton, B. L. Apparent encoding of sequential context in rat medial prefrontal cortex is accounted for by behavioral variability. J. Neurosci. 26, 13143–13155 (2006).
Article CAS PubMed Google Scholar
Fujisawa, S., Amarasingham, A., Harrison, M. T. & Buzsáki, G. Behavior-dependent short-term assembly dynamics in the medial prefrontal cortex. Nat. Neurosci. 11, 823–833 (2008).
Article CAS PubMed PubMed Central Google Scholar
Hollerman, J. R. & Schultz, W. Dopamine neurons report an error in the temporal prediction of reward during learning. Nat. Neurosci. 1, 304–309 (1998).
Article CAS PubMed Google Scholar
Narayanan, N. S. & Laubach, M. Neuronal correlates of post-error slowing in the rat dorsomedial prefrontal cortex. J. Neurophysiol. 100, 520–525 (2008).
Article PubMed PubMed Central Google Scholar
Rushworth, M. F. S. & Behrens, T. E. J. Choice, uncertainty and value in prefrontal and cingulate cortex. Nat. Neurosci. 11, 389–397 (2008).
Article CAS PubMed Google Scholar
Seamans, J. K., Floresco, S. B. & Phillips, A. G. D1 receptor modulation of hippocampal-prefrontal cortical circuits integrating spatial memory with executive functions in the rat. J. Neurosci. 18, 1613–1621 (1998).
CAS PubMed Google Scholar
Ma, L., Hyman, J. M., Lindsay, A. J., Phillips, A. G. & Seamans, J. K. Differences in the emergent coding properties of cortical and striatal ensembles. Nat. Neurosci. 17, 1100–1106 (2014).
Article CAS PubMed PubMed Central Google Scholar
Birrell, J. M. & Brown, V. J. Medial frontal cortex mediates perceptual attentional set shifting in the rat. J. Neurosci. 20, 4320–4324 (2000).
CAS PubMed Google Scholar
Bissonette, G. B. et al Double dissociation of the effects of medial and orbital prefrontal cortical lesions on attentional and affective shifts in mice. J. Neurosci. 28, 11124–11130 (2008).
Article CAS PubMed PubMed Central Google Scholar
Cho, K. K. et al Gamma rhythms link prefrontal interneuron dysfunction with cognitive inflexibility in Dlx5/6(+/-) mice. Neuron 85, 1332–1343 (2015).
Article CAS PubMed PubMed Central Google Scholar
Bissonette, G. B. & Roesch, M. R. Neural correlates of rules and conflict in medial prefrontal cortex during decision and feedback epochs. Front Behav. Neurosci. 9, 266 (2015).
Article PubMed PubMed Central Google Scholar
Rodgers, C. C. & DeWeese, M. R. Neural correlates of task switching in prefrontal cortex and primary auditory cortex in a novel stimulus selection task for rodents. Neuron 82, 1157–1170 (2014).
Article CAS PubMed Google Scholar
Cools, R. in Cognitive Search: Evolution, Algorithms, and the Brain: Strüngmann forum report. (eds Todd, P. M., Hills, T. T., & Robbins, T. W.) 111–124 (MIT Press, Cambridge, MA, 2012).
Rainer, G. & Miller, E. K. Effects of visual experience on the representation of objects in the prefrontal cortex. Neuron 27, 179–189 (2000).
Article CAS PubMed Google Scholar
Mante, V., Sussillo, D., Shenoy, K. V. & Newsome, W. T. Context-dependent computation by recurrent dynamics in prefrontal cortex. Nature 503, 78–84 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Mansouri, F. A., Tanaka, K. & Buckley, M. J. Conflict-induced behavioural adjustment: a clue to the executive functions of the prefrontal cortex. Nat. Rev. Neurosci. 10, 141–152 (2009).
Article CAS PubMed Google Scholar
Tanji, J. & Hoshi, E. Role of the lateral prefrontal cortex in executive behavioral control. Physiol. Rev. 88, 37–57 (2008).
Article PubMed Google Scholar
Ma, L., Hyman, J. M., Durstewitz, D., Phillips, A. G. & Seamans, J. K. A quantitative analysis of context-dependent remapping of medial frontal cortex neurons and ensembles. J. Neurosci. 36, 8258–8272 (2016).
Article CAS PubMed Google Scholar
Hoover, W. B. & Vertes, R. P. Anatomical analysis of afferent projections to the medial prefrontal cortex in the rat. Brain Struct. Funct. 212, 149–179 (2007).
Article PubMed Google Scholar
Jaeger, H., Lukosevicius, M., Popovici, D. & Siewert, U. Optimization and applications of echo state networks with leaky-integrator neurons. Neural Netw. 20, 335–352 (2007).
Article PubMed MATH Google Scholar
Maass, W., Natschlager, T. & Markram, H. Real-time computing without stable states: a new framework for neural computation based on perturbations. Neural Comput. 14, 2531–2560 (2002).
Article PubMed MATH Google Scholar
Barak, O. & Tsodyks, M. Working models of working memory. Curr. Opin. Neurobiol. 25, 20–24 (2014).
Article CAS PubMed Google Scholar
Bernacchia, A., Seo, H., Lee, D. & Wang, X.-J. A reservoir of time constants for memory traces in cortical neurons. Nat. Neurosci. 14, 366–372 (2011).
Article CAS PubMed PubMed Central Google Scholar
Paxinos, G. & Watson, C. The Rat Brain in Stereotaxic Coordinates 6th Edn, 547612–547612 (Academic Press, Australia, 2006).
Csicsvari, J., Hirase, H., Czurko, A. & Buzsaki, G. Reliability and state dependence of pyramidal cell-interneuron synapses in the hippocampus: an ensemble approach in the behaving rat. Neuron 21, 179–189 (1998).
Article CAS PubMed Google Scholar
Kadir, S. N., Goodman, D. F. & Harris, K. D. High-dimensional cluster analysis with the masked EM algorithm. Neural Comput. 26, 2379–2394 (2014).
Article PubMed PubMed Central Google Scholar
Hazan, L., Zugaro, M. & Buzsáki, G. Klusters, NeuroScope, NDManager: A free software suite for neurophysiological data processing and visualization. J. Neurosci. Methods 155, 207–216 (2006).
Article PubMed Google Scholar
Van Der Maaten, L. J. P., Postma, E. O. & Van Den Herik, H. J. Dimensionality reduction: a comparative review. J. Mach. Learn Res. 10, 1–41 (2009).
Google Scholar

Download references

Acknowledgements

This work was supported by the Vienna Science and Technology Fund (WWTF), project LS14-095. We thank E.Borok and R.Hauer for excellent technical support; F.Stella, B.Lasztoczi and T.Oezdemir for commenting on a previous version of the manuscript and M.Lagler for comments on the analysis.

Author information

Stephane Ciocchi
Present address: Department of Physiology, University of Bern, 3012, Bern, Switzerland

Authors and Affiliations

Center for Brain Research, Divison of Cognitive Neurobiology, Medical University of Vienna, 1090, Vienna, Austria
Hugo Malagon-Vina, Stephane Ciocchi, Johannes Passecker & Thomas Klausberger
Center for Medical Statistics, Section for Artificial Intelligence, Informatics and Intelligent Systems, Medical University of Vienna, 1090, Vienna, Austria
Georg Dorffner

Authors

Hugo Malagon-Vina
View author publications
You can also search for this author in PubMed Google Scholar
Stephane Ciocchi
View author publications
You can also search for this author in PubMed Google Scholar
Johannes Passecker
View author publications
You can also search for this author in PubMed Google Scholar
Georg Dorffner
View author publications
You can also search for this author in PubMed Google Scholar
Thomas Klausberger
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

H.M.-V., S.C., J.P., G.D. and T.K. contributed to experiments, data analysis, and preparation of the manuscript.

Corresponding authors

Correspondence to Hugo Malagon-Vina or Thomas Klausberger.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Additional information

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Supplementary Information

Peer Review File

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Malagon-Vina, H., Ciocchi, S., Passecker, J. et al. Fluid network dynamics in the prefrontal cortex during multiple strategy switching. Nat Commun 9, 309 (2018). https://doi.org/10.1038/s41467-017-02764-x

Download citation

Received: 01 June 2017
Accepted: 26 December 2017
Published: 22 January 2018
DOI: https://doi.org/10.1038/s41467-017-02764-x

This article is cited by

A persistent prefrontal reference frame across time and task rules
- Hannah Muysers
- Hung-Ling Chen
- Marlene Bartos
Nature Communications (2024)
Characteristics of the Neuronal Support for Operative Behavior Formed by Mono- and Multistep Methods
- E. A. Kuzina
- Yu. I. Aleksandrov
Neuroscience and Behavioral Physiology (2020)
MUW researcher of the month
- Hugo Malagon-Vina
Wiener klinische Wochenschrift (2019)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.