A hybrid biological neural network model for solving problems in cognitive planning

Powell, Henry; Winkel, Mathias; Hopp, Alexander V.; Linde, Helmut

doi:10.1038/s41598-022-11567-0

Download PDF

Article
Open access
Published: 23 June 2022

A hybrid biological neural network model for solving problems in cognitive planning

Henry Powell^1,2,
Mathias Winkel¹,
Alexander V. Hopp¹ &
…
Helmut Linde^1,3

Scientific Reports volume 12, Article number: 10628 (2022) Cite this article

2433 Accesses
1 Citations
1 Altmetric
Metrics details

Subjects

Abstract

A variety of behaviors, like spatial navigation or bodily motion, can be formulated as graph traversal problems through cognitive maps. We present a neural network model which can solve such tasks and is compatible with a broad range of empirical findings about the mammalian neocortex and hippocampus. The neurons and synaptic connections in the model represent structures that can result from self-organization into a cognitive map via Hebbian learning, i.e. into a graph in which each neuron represents a point of some abstract task-relevant manifold and the recurrent connections encode a distance metric on the manifold. Graph traversal problems are solved by wave-like activation patterns which travel through the recurrent network and guide a localized peak of activity onto a path from some starting position to a target state.

Computational complexity drives sustained deliberation

Article Open access 24 April 2023

Foundations of human spatial problem solving

Article Open access 27 January 2023

Intelligent problem-solving as integrated hierarchical reinforcement learning

Article 25 January 2022

Introduction

Building a bridge between structure and function of neural networks is an ambition at the heart of neuroscience. Historically, the first models studied were simplistic artificial neurons arranged in a feed-forward architecture. Such models are still widely applied today—forming the conceptual basis for Deep Learning. They have shaped our intuition of neurons as “feature detectors” which fire when a certain approximate configuration of input signals is present, and which aggregate simple features to more and more complex ones layer by layer. Yet in the brain, the vast majority of neural connections is recurrent, and although several possible explanations of their function have been proposed^1,2,3, their computational purpose is still little understood⁴.

In the present paper, we propose a new algorithmic role which recurrent neural connections might play, namely as a computational substrate to solve graph traversal problems. We argue that many cognitive tasks like navigation or motion planning can be framed as finding a path from a starting position to some target position in a space of possible states. The possible states may be encoded by neurons via their “feature-detector property”. Allowed transitions between nearby states would then be encoded in recurrent connections, which can form naturally via Hebbian learning since the feature detectors’ receptive fields overlap. They may eventually form a “map” of some external system. Activation propagating through the network can then be used to find a short path through this map. In effect, the neural dynamics then implement an algorithm similar to Breadth-First Search on a graph.

Proposed model

A network of neurons that represents a manifold of stimuli

We consider a neural network which is exposed to some external stimuli-generating process under the assumption that the possible stimuli can be organized in some continuous manifold in the sense that similar stimuli are located close to each other on this manifold. For example, in the case of a mouse running through a maze, all possible perceptions can be associated with a particular position in a two-dimensional map, and neighboring positions will generate similar perceptions, see Fig. 1a.

Proprioception, i. e. the sense of location of body parts, can also be a source of stimuli. For example, for a simplified arm with two degrees of freedom every possible position of the arm corresponds to one specific stimulus, cf. Fig. 1b. All possible stimuli combined give rise to a two-dimensional manifold. The example also shows that the manifold will usually be restricted since not every conceivable combination of two joint angles might be a physically viable position for the arm.

The manifold of potential stimuli needs not necessarily be embedded in a flat Euclidean space as in the case of the maze. For example, if the stimuli are two-dimensional figures which can be shifted horizontally or rotated on a screen, the corresponding manifold is two-dimensional (one translational parameter plus one for the rotation angle) but it is not isomorphic to a flat plane since a change of the rotation angle by $2\pi$ maps the figure onto itself again, see Fig. 1c.

We assume that such manifolds of stimuli are approximated by the connectivity structure of a neural network which forms via a learning process. The result is a neural structure which we call a cognitive map. The defining property of a cognitive map is that is has a neural encoding for every possible stimulus and that two similar stimuli, i. e. stimuli which are close to each other in the manifold of stimuli, are represented by similar encodings, i. e. encodings which are close to each other in the cognitive map (of course, we do not imply that two neurons which are close to each other in the connectivity structure are also close to each other with respect to their physical location in the neural tissue).

For the model, we make a very simplistic choice and assume a single-neuron encoding, i. e. the manifold of stimuli is covered by the receptive fields of individual neurons. Each such receptive field is a small localized area in the manifold and two neighboring receptive fields may overlap, see Fig. 2. Such an encoding is a typical outcome for a single layer of neurons which are trained in a competitive Hebbian learning process⁵.

The key idea of the model is that solving a problem that can be formulated as a planning problem in the manifold of stimuli, can be solved as a planning problem in a corresponding cognitive map. To this end, it is not enough to consider the cognitive map as a set of individual points, but its topology must be known as well. This topological information will be encoded in the recurrent connections of the neural network.

It seems natural that a neural network could learn this topology via Hebbian learning: Two neurons with close-by receptive fields in the manifold will be excited simultaneously relatively often because their receptive fields overlap. Consequently, recurrent connections within the cognitive map will be strengthened between such neurons and the topology of the neural network will approximate the topology of the manifold, see Fig. 2. This idea has been explored in more detail by Curto and Itskov in⁶. Indeed, previous work on the formation of neocortical maps that code for ocular dominance and stimulus orientation suggest that the formation of cognitive maps could well occur in this fashion⁷. For a review and comparison of these kinds of cognitive maps see⁸. Recent studies also show that recurrent neural networks might serve even more purposes, for example for working memory^9,10 or image recognition¹¹.

To avoid confusion with related concepts in machine learning, note that the present definition of recurrence is not exactly the same as the one used, for example, in Long Short-Term Memory networks¹². Those algorithms employ recurrent connections as a loop to mix some input signal of a neural network with the output signal from a previous time step. The present model, however, separates between the primary excitation by some external stimulus via feed-forward connections and the resulting dynamics of the network mediated by the recurrent connections as described in the following.

Dynamics required for solving planning problems

Having set up a network that represents a manifold of stimuli, we need to endow this network of feed-forward and recurrent connections with dynamics. We do so by imposing two interacting mechanisms.

First, the neurons in the network should exhibit continuous attractor dynamics¹³: If a “clique” of a few tightly connected neurons are activated by a stimulus via the corresponding feed-forward pass, they keep activating each other while inhibiting their wider neighborhood. The result is a self-sustained, localized neural activity surrounded by a “trench of inhibition”. In the model, this encodes the as-is situation or the starting position for the planning problem. Such a state is called an “attractor” since it is stable under small perturbations of the dynamics, and it is part of a continuous landscape of attractors with different locations across the network. The dynamics of these kinds of bumps of activity in neural sheets of different kinds has been studied in depth in¹⁴ and applied to more general problems in neurosciene¹⁵ but have not, as of yet, been used as means to solve planning problems in the way proposed here.

Second, the neural network should allow for wave-like expansion of activity. If a small number of close-by neurons are activated by some hypothetical executive brain function (i. e. not via the feed-forward pass), they activate their neighbors, which in turn activate theirs, and so on. The result is a wave-like front of activity propagating through the recurrent network. The neurons which have been activated first encode the to-be state or the end position of the planning problem.

The key to solving a planning problem is in the interaction between the two types of dynamics, namely in what happens when the expanding wave front hits the stationary peak of activity. On the side where the wave is approaching it, the “trench of inhibition” surrounding the peak is in part neutralized by the additional excitatory activation from the wave. Consequently, the containment of the activity peak is somewhat “softer” on the side where the wave hit it and it may move a step towards the direction of the incoming wave. This process repeats, leading to a small change of position with every incoming wave front. The localized peak of excitation will follow the wave fronts back to their source, thus moving along a route through the manifold from start to end position, see Fig. 3.

The two types of dynamics described above are seemingly contradictory, since the first one restricts the system to localized activity, while the second one permits a wave-like propagation of activity throughout the system. To resolve the conflict in numerical simulations, we have split the dynamics into a continuous attractor layer and a wave propagation layer, which are responsible for different aspects of the system’s dynamical behaviour. We discuss the concepts of a numerical implementation in the section “Implementation in a numerical proof-of-concept” and ideas for a biologically more plausible implementation in the “Discussion” section.

Connection to real-life cognitive processes

To make the proposed concept more tangible, we present a rough sketch of how it could be embedded in a real-life cognitive process along with a speculative proposal for its anatomical implementation in the special case of motor control.

As an example, we consider a human grabbing a cup of coffee and we explain how the presented model complements and details the processes described in¹⁶ for that particular case. According to our hypothesis, the as-is position of the subject’s arm is encoded as a localized peak of activity in the cognitive map encoding the complex manifold of arm positions. Anatomically, this cognitive map is certainly of a more complicated structure than the one in our simple model and it is possibly shared between primary motor cortex and primary somatosensory cortex.

We assume that the encoding of the arm’s state works in a bi-directional way, somewhat like the string of a puppet: When the arm is moved by external forces, the neural representation of its position mediated by afferent somatosensory signals moves along with it. On the other hand, if the representation in the cortical map is changed slightly by some cognitive process, then some hypothetical control mechanism of the primary motor cortex sends efferent signals to the muscles in an attempt to make the arm follow its neural representation and bring the limb and its representation back into congruence.

If now the human subject decides to grab the cup of coffee, some executive brain function with heavy involvement from prefrontal cortex constructs a to-be state of holding the cup: The final position of the hand with the fingers around the cup handle is what the person consciously thinks of. The high-level instructions generated by prefrontal cortex are possibly translated by the premotor cortex into a specific target state in the cognitive map that represents the manifold of possible arm positions. The neurons of the primary motor cortex and/or the primary somatosensory cortex representing this target state are thus activated.

The activation creates waves of activity propagating through the network, reaching the representation of the as-is state and shifting it slightly towards the to-be state. The hypothetical muscle control mechanism reacts on this disturbance and performs a motor action to keep the physical position of the arm and its representation in the cognitive map in line. As long as the person implicitly represents the to-be state, the arm “automatically” performs the complicated sequence of many individual joint movements which is necessary to grab the cup.

This concept can be extended to flexibly consider restrictions that have not been hard-coded in the cognitive map by learning. For example, in order to grab the cup of coffee, the arm may need to avoid obstacles on the way. To this end, the hypothetical executive brain function which defines the target state of the hand could also temporarily “block” certain regions of the cognitive map (e. g. via inhibition) which it associates with the discomfort of a collision. Those parts of the network which are blocked cannot conduct the “planning waves” anymore and thus a path around those regions will be found.

Implementation in a numerical proof-of-concept

To substantiate the presented conceptual ideas, we performed numerical experiments using multiple different setups. In each case, the implementation of the model employs two neural networks that both represent the same manifold of stimuli.

The continuous attractor layer is a sheet of neurons that models the functionality of a network of place cells in the human hippocampus^17,18. Each neuron is implemented as a rate-coded cell embedded in its neighborhood via short-range excitatory and long-range inhibitory connections as in¹⁹. This structure allows the formation of a self-sustaining “bump” of activity, which can be shifted through the network by external perturbations.

The wave propagation layer is constructed with an identical number of excitatory and inhibitory Izhikevich neurons^20,21, properly connected to allow for stable signal propagation across the manifold of stimuli. The target node is permanently stimulated, causing it to emit waves of activation which travel through the network.

The interaction between the two layers is modeled in a rather simplistic way. As in¹⁹, a time-dependent direction vector was introduced in the synaptic weight matrix of the continuous attractor layer. It has the effect of shifting the synaptic weights in a particular direction which in turn causes the location of the activation bump in the attractor layer to shift to a neighbouring neuron. The direction vector is updated whenever a wave of activity in the wave propagation layer newly enters the region which corresponds to the bump in the continuous attractor layer. Its direction is set to point from the center of the bump to the center of the overlap area between bump and wave, thus causing a shift of the bump towards the incoming wave fronts.

For more details on the implementation, see “Methods and experiments” below.

Results of the numerical experiments

In a very simple initial configuration, the path finding algorithm was tested on a fully populated quadratic grid of neurons as described before. Figure 4 shows snapshots of wave activity and continuous attractor position at some representative time points during the simulation. As expected, stimulation of the wave propagation layer in the lower right of the cognitive map causes the emission of waves, which in turn shift the bump in the continuous attractor layer from its starting position in the upper left towards its target state.

As described in the section “Connection to real-life cognitive processes” above, the manifold of stimuli represented by the neural network can be curved, branched, or of different topology, either permanently or temporarily. The purpose of the model is to allow for a reliable solution to the underlying graph traversal problems independent of potential obstacles in the networks. For this reason we investigated whether the bump of activation in the continuous attractor layer was able to successfully navigate through the graph from the starting node to the end node in the presence of nodes that could not be traversed. To test this idea we constructed different “mazes”, blocking off sections of the graph by zeroing the synaptic connections of the respective neurons in the wave propagation layer and by clamping activation functions of the corresponding neurons in the continuous attractor layer to zero, see Fig. 5. We found that in all these setups, the algorithm was able to successfully navigate the bump in the continuous attractor layer through the mazes.

Relation to existing graph traversal algorithms

To conclude this section, we highlight a few parallels between the presented approach and the classical Breadth-First Search (BFS) algorithm.

BFS begins at some start node $s$ of the graph and marks this node as “visited”. In each step, it then chooses one node which is “visited” but not “finished” and checks whether there are still unvisited nodes that have an edge to this node. If so, the corresponding nodes are also marked as “visited”, the current node is marked as “finished” and another iteration of the algorithm is started.

The approach presented here is a parallelized variant of this algorithm. Assuming that all neurons always obtain sufficient current to become activated, the propagating wave corresponds to the step of the algorithm in which the neighbors of the currently considered node are investigated. In contrast to BFS, the algorithm performs this step for all candidate nodes in a single step. That is, it considers all nodes currently marked as visited, checks the neighbors of all these nodes at once and marks them as visited if necessary.

Having all ingredients of the proposed conceptual framework in place, the following section reviews some experimental evidence indicating that it could in principle be employed by biological brains.

Empirical evidence

Cognitive maps

The concept of “cognitive maps” was first proposed by Edward Tolman^22,23, who conducted experiments to understand how rats were able to navigate mazes to seek rewards.

A body of evidence suggests that neural structures in the hippocampus and enthorinal cortex potentially support cognitive maps used for spatial navigation^17,24,25. Within these networks, specific kinds of neurons are thought to be responsible for the representation of particular aspects of cognitive maps. Some examples are place cells^17,24 which code for the current location of a subject in space, grid cells which contribute to the problem of locating the subject in that space²⁶ as well as supporting the stabilisation of the attractor dynamics of the place cell network¹⁹, head-direction cells²⁷ which code for the direction in which the subject’s head is currently facing, and reward cells²⁸ which code for the location of a reward in the same environment.

The brain regions supporting spatially aligned cognitive maps might also be utilized in the representation of cognitive maps in non-spatial domains: In²⁹, fMRI recordings taken from participants while they performed a navigation task in a non-spatial domain showed that similar regions of the brain were active for this task as for the task outlined in³⁰ where participants navigated a virtual space using a VR apparatus. Further, according to³¹, activation of neurons in the hippocampus (one of the principal sites for place cells) is indicative of how well participants were able to perform in a task related to pairing words. Supporting this observation with respect to the role played by these brain regions in the operation of abstract cognitive maps³², found that lesions to the hippocampus significantly impaired performance on a task of associating pairs of odors by how similar they smelled. Finally, complementing these findings, rat studies have shown that hippocampal cells can code for components in navigation tasks in auditory^33,34, olfactory³⁵, and visual³⁶ task spaces.

Feed-forward and recurrent connections

As described in the section “A network of neurons that represents a manifold of stimuli”, the proposed model is built around a particular theme of connectivity: Each neuron represents a certain pattern in sensory perception mediated via feed-forward connections. In addition, recurrent connections between two neurons strengthen whenever they are activated simultaneously. In the following, we give an overview of some relevant experimental observations which are consistent with this mode of connectivity.

The most prominent example of neurons which are often interpreted as pattern detectors are the cells in primary visual cortex. These neurons fire when a certain pattern is perceived at a particular position and orientation in the visual field. On the one hand, these neurons receive their feed-forward input from the lateral geniculate nucleus. On the other hand, they are connected to each other through a tight network of recurrent connections. Several studies (see e. g.^37,38,39) have shown that two such cells are preferentially connected when their receptive fields are co-oriented and co-axially aligned. Due to the statistical properties of natural images, where elongated edges appear frequently, such two cells can also be expected to be positively correlated in their firing due to feed-forward activation.

The somatosensory cortex is another brain region where several empirical findings are in line with the postulated theme of connectivity. Experiments on non-human primates suggest that “3b neurons act as local spatiotemporal filters that are maximally excited by the presence of particular stimulus features”⁴⁰.

Regarding the recurrent connections in somatosensory cortex, some empirical support stems from the well-studied rodent barrel cortex. Here, the animal’s facial whiskers are represented somatotopically by the columns of primary somatosensory cortex. Neighboring columns of the barrel cortex are connected via a dense network of recurrent connections. Sensory deprivation studies indicate that the formation of these connections depends on the feed-forward activation of the respective columns: If the whiskers corresponding to one of the columns are trimmed during early post-natal development, the density of recurrent connections with this column is reduced^41,42. Conversely, synchronous co-activation over the course of a few hours can lead to increased functional connectivity in the primary somatosensory cortex⁴³.

The primary somatosensory cortex also receives proprioceptive signals from the body which represent individual joint angles. Taken as a whole, these signals characterize the current posture of the animal and there is an obvious analogy to the arm example, cf. Fig. 1b. We are not aware of any experimental results regarding the recurrent connections between proprioception detectors, but it seems reasonable to expect that the results about processing of tactile input in the somatosensory cortex can be extrapolated to the case of proprioception. This would imply that a recurrent network structure roughly similar to Fig. 1b should emerge and thus support the model for controlling the arm.

Area 3a of the somatosensory cortex, whose neurons exhibit primarily proprioceptive responses, is also densely connected to the primary motor cortex. It contains many corticomotoneuronal cells which drive motoneurons of the hand in the spinal cord⁴⁴. This tight integration between sensory processing and motor control might be a hint that the hypothetical string-of-a-puppet muscle control mechanism from the section on the “Connection to real-life cognitive processes” is not too far from reality.

In summary, evidence from primary sensory cortical areas seems to suggest a common cortical theme of connectivity in which neurons are tuned to specific patterns in their feed-forward input from other brain regions, while being connected intracortically based on statistical correlations between these patterns.

Wave phenomena in neural tissue

There is a large amount of empirical evidence for different types of wave-like phenomena in neural tissue. We summarize some of the experimental findings, focusing on fast waves (a few tens of $\hbox {cm}\,\hbox {s}^{-1}$). These waves are suspected to have some unknown computational purpose in the brain⁴⁵ and they seem to bear the most resemblance with the waves postulated in the model.

Using multielectrode local field potential recordings, voltage-sensitive dye, and multiunit measurements, traveling cortical waves have been observed in several brain areas, including motor cortex, visual cortex, and non-visual sensory cortices of different species. There is evidence for wave-like propagation of activity both in sub-threshold potentials and in the spatiotemporal firing patterns of spiking neurons⁴⁶.

In the motor cortex of wake, behaving monkeys, Rubino et al.⁴⁷ observed wave-like propagation of local field potentials. They found correlations between some properties of these wave patterns and the location of the visual target to be reached in the motor task. On the level of individual neurons, Takahasi et al. found a “spatiotemporal spike patterning that closely matches propagating wave activity as measured by LFPs in terms of both its spatial anisotropy and its transmission velocity”⁴⁸.

In the visual cortex, a localized visual stimulus elicits traveling waves which traverse the field of vision. For example, Muller et al. have observed such waves rather directly in single-trial voltage-sensitive dye imaging data measured from awake, behaving monkeys⁴⁹.

Spatial navigation using place cells

Finding a short path through a maze-like environment, cf. Fig. 1a, is one of the planning problems the model is capable of solving. In this case, each neuron of the continuous attractor layer represents a “place cell” which encodes a particular location in the maze.

Place cells were discovered by John O’Keefe and Jonathan Dostrovsky in 1971 in the hippocampus of rats¹⁷. They are pyramidal cells that are active when an animal is located in a certain area (“place field”), of the environment. Place cells are thought to use a mixture of external sensory information and stabilizing internal dynamics to organize their activity: On the one hand, they integrate external environmental cues from different sensory modalities to anchor their activity to the real world. This is evidenced by the fact that their activity is affected by changes in the environment and that it is stable under a removal of a subset of cues^50,51. On the other hand, firing patterns are then stabilized and maintained by internal network dynamics as cells remain active under conditions of total sensory deprivation⁵². Collectively, the place cells are thought to form a cognitive map of the animal’s environment.

Targeted motion caused by localized neuron stimulation

In 2002, Graziano et al. reported results from electrical microstimulation experiments in the primary motor and premotor cortex of monkeys⁵³. Stimulation of different sites in the cortical tissue for a duration of 500 ms resulted in complex body motions involving many individual muscle commands. The stimulation of one particular site typically led to smooth movements with a certain end state, independent of the initial posture of the monkey, while stimulating a different location in the cortical tissue led to a different end state. In terms of the model presented here, this would be explained by two wave fronts propagating in opposite directions away from the to-be location, only one of which hits the localized peak of activity encoding the as-is location and pulls it closer to the to-be state. Graziano et al. also reported that the motions stopped as soon as the electrical stimulus was turned off. This is fully consistent with our model, where stopping the to-be activation means that no more wave fronts are created and thus the as-is peak of activity remains where it is.

After this original discovery by Graziano et al. in 2002, several additional studies have confirmed and extended their results, see⁵⁴ for an overview. The neural structures which cause the bodily motions towards a specific target state have been named ethological maps or action maps⁵⁴.

Furthermore, several studies suggest that such action maps are shaped by experience: Restricting limb movements for thirty days in a rat can cause the action map to deteriorate. A recovery of the map is observed during the weeks after freeing the restrained limb⁵⁵. Conversely, a reversible local deactivation of neural activity in the action map can temporarily disable a grasping action in rats⁵⁶. A permanent lesion in the cortical tissue can disable an action permanently. The animal can re-learn the action, though, and the cortical tissue reorganizes to represent the newly re-learnt action at a different site⁵⁷. These observed plasticity phenomena are fully in line with our model which emphasises a self-organized formation of the cognitive map via Hebbian processes both for the feature learning and for the construction of the recurrent connections.

Participation of the primary sensory cortex in non-sensory tasks

For the first two examples in Fig. 1, the association with a planning task is obvious. Our third example, the geometric transformations of the letter “A”, may appear a bit more surprising, though: After all, the neural structures in visual sensory cortex would then be involved in “planning tasks”. The tissue of at least V1 fits the previously explained theme of connectivity, but it is often thought of as a pure perception mechanism which aggregates optical features in the field of vision and thus performs some kind of preprocessing for the higher cortical areas.

However, there is evidence that the visual sensory cortex plays a much more active role in cognition than pure feature detection on the incoming stream of visual sensory information. In particular, the visual cortex is active in visual imagery, that is, when a subject with closed eyes mentally imagines a visual stimulus⁵⁸.

Based on such findings, it has been suggested that “the visual cortex is something akin to a ‘representational blackboard’ that can form representations from either the bottom-up or top-down inputs”⁵⁸. In our model, we take this line of thinking one step further and speculate that the early visual cortex does not only represent visual features, but that it also encodes possible transformations like rotation, scaling or translation via its recurrent connections. In this view, the “blackboard” becomes more of a “magnetic board” on which mental images can be placed and shifted around according to rules which have been learned by experience.

Of course, despite the over-simplifying Fig. 1c, we do not intend to imply that there were any neurons in the visual cortex with a complex pattern like the whole letter “A” as a receptive field. In reality, we would expect the letter to be represented in early visual cortex as a spatio-temporal multi-neuron activity pattern. The current version of our model, on the other hand, allows for single-neuron encoding only and thus reserves one neuron for each possible position of the letter. We will discuss this and other limitations of the proposed model in the “Discussion” section.

Temporal dynamics

The concept presented in this article implies predictions about the temporal dynamics of cognitive planning processes which can be compared to experiments: The bump of activity only starts moving when the first wave front arrives. Assuming that every wave front has a similar effect on the bump, its speed of movement should be proportional to the frequency with which waves are emitted. Thus both the time until movement onset and the duration of the whole planning process should be proportional to the length of the traversed path in the cortical map. Increased frequency of wave emission should accelerate the process.

One supporting piece of evidence is provided by mental imagery: Experiments in the 1970s^59,60 have triggered a series of studies on mental rotation tasks, where the time to compare a rotated object with a template has often been found to increase proportionally with the angle of rotation required to align the two objects.

In the case of bodily motions, the total time to complete the cognitive task is not a well suited measure since it strongly depends on mechanical properties of the limbs. Yet for electrical stimulation of the motor cortex (cf. “Targeted motion caused by localized neuron stimulation” section) Graziano et al. report that the speed of evoked arm movements increases with stimulation frequency⁶¹. Assuming that this frequency determines the rate at which the hypothetical waves of activation are emitted, this is consistent with our model.

In addition, our model makes the specific prediction that the latency between stimulation and the onset of muscle activation should increase with the distance between initial and target posture. The reason is that the very first wave front needs to travel through the cognitive map before the bump of activation starts being shifted, and only then muscular activation can be triggered by the bump’s deflection. The travel time of this wave front thus becomes an additive component of the total latency and it can be expected to be roughly proportional to the distance between initial and target posture as measured in the metric of the cognitive map. We are not aware of any studies having examined this particular relationship yet.

Discussion

The model proposed here is, to the best of our knowledge, the first model that allows for solving graph problems in a biological plausible way such that the solution (i. e. the specific path) can be calculated directly on the neural network as the only computational substrate.

Similar approaches and models have been investigated earlier, especially in the field of neuromorphic computing. For example, in^{62,63,64,65,66} graphs are modeled using neurons and synapses, and computations are performed by exciting specific neurons which induces propagation of current in the graph and observing the spiking behavior. Also, models using two or more cell layers and spiking neural neurons have been used for unsupervised learning of orientation, disparity, and motion representations⁶⁷ or modeling the tactile processing pathway⁶⁸. In addition, recurrent neural networks were recently also used to model and analyze working memory^9,10 or image recognition tasks¹¹. These models are however either designed for very specific tasks⁶⁸, do not guarantee a stable performance¹¹ or lack biological plausibility^9,10,67 . Furthermore⁶⁹, describes another neural computation mechanism which “might be a general computational mechanism of cortical circuits”⁶⁹ using circuit models of spiking neurons. This mechanism is developed for understanding how spontaneous activity is involved in visual processing and is not investigated in terms of its applicability for solving planning problems.

Although some models are more general than the one presented here and allow for solving more complex problems like dynamic programs⁶³, enumeration problems⁶⁵ or the longest shortest path problem⁶⁶, we are not aware of any model explicitly discussing the biological plausibility despite the need for more neurobiologically realistic models⁷⁰. In fact, most of these approaches are far from being biologically plausible as they e. g. require additional artificial memory⁶³ or a preprocessing step that changes the graph depending on the input data⁶⁶. Also, the model of Muller et al.⁶² as well as the very recent model of Aimone et al.⁶⁴ which are biologically more plausible do not discuss how a specific path can then be computed in the graph, even if the length of a path can be calculated⁶⁴. In addition, some models try to describe actually observed wave propagation in the brain^71,72.

In the following we discuss limitations of the presented model and potential avenues for further research.

Single-neuron vs. multi-neuron encoding

In our model, each point on a cortical map is represented by a single neuron and a distance on the map is directly encoded in a synaptic strength between two neurons. The graph of synaptic connections can therefore be considered as a coarse-grained version of the underlying manifold of stimuli. Yet such a single-neuron representation is possible only for manifolds of a very low dimension, since the number of points necessary to represent the manifold grows exponentially with each additional dimension. For tasks like bodily movement, where dozens of joints need to be coordinated, the number of neurons required to represent every possible posture in a single-neuron encoding is prohibitive. Therefore, it is desirable to encode manifolds of stimuli in a more economical way—for example, by representing each point of the manifold by a certain set of neurons. It is an open question how distance relationships between such groups of neurons could be encoded and whether the dynamics from our model could be replicated in such a scenario.

Embedding into a bigger picture

While the model focuses on the solution of graph traversal problems, it appears desirable to embed it into a broader context of sensory perception, decision making, and motion control in the brain. One particular question is how the hypothetical “puppet string mechanism”—which we postulated to connect proprioception and motion control—could be implemented in a neural substrate. Similarly, if our model provides an appropriate description of place cells and their role in navigation, the question arises how a shift in place cell activity is translated into appropriate muscle commands to propel the animal into the corresponding direction.

It is intriguing to speculate about a deeper connection between our model and object recognition: On the same neural substrate, our hypothetical waves might travel through a space of possible transformations, starting from a perceived stimulus and “searching” for a previously learned representative of the same class of objects. This could explain why recognition of rotated objects is much faster than the corresponding mental rotation task⁷³: The former would require only one wave to travel through the cognitive map, while the latter would require many waves to move the bump of activity.

Conclusion

We have shown that a wide range of cognitive tasks, especially those that involve planning, can be represented as graph problems. To this end, we have detailed one possible role for the recurrent connections that exist throughout the brain as computational substrate for solving graph traversal problems. We showed in which way such problems can be modeled as finding a short path from a start node to some target node in a graph that maps to a manifold representing a relevant task space. Our review of empirical evidence indicates that a theme of connectivity can be observed in the neural structure throughout (at least) the neocortex which is well suited to realize the proposed model.

Methods and experiments

The model described in the “Proposed model” section above treats the recurrent neural network as a discretized approximation to the manifold of stimuli. Thus, the problem of finding a short path through that manifold translates into a graph traversal problem in the corresponding graph of synaptic connections. In the following, the starting and target position of the planning process are denoted by $s$ and $t$, respectively.

Neuronal network setup—exemplary implementation of the model

Splitting dynamics to two network layers

As described in the “A network of neurons that represents a manifold of stimuli” section, for our numerical implementation of the model, we separated the two different types of dynamics into distinct layers of neurons, the continuous attractor layer and the wave propagation layer. The split into two layers makes the model more transparent and ensures that parameter changes have limited and traceable effects on the over-all dynamics. As an additional simplification, we do not explicitly model the feed-forward connections which drive the wave propagation layer, but we rather directly activate certain neurons in this layer.

Activation in the continuous attractor layer C represents the start node $s$, that in the course of the simulation will move towards the target node $t$, which is permanently stimulated in the wave propagation layer P. Waves of activation are travelling from $t$ across P. As soon as the wave front reaches a node in P that is connected to a node in proximity to the current activation in C, the activation in C is moved towards it. Thus, every arriving wave front will pull the activation in C closer to $t$, forcing the activation to trace back the wave propagation to its origin $t$.

In detail, these dynamics require a very specific network configuration which is described in the following. Figure 6 contains a general overview of the intra- and inter-layer connectivity used in the model and our simulations.

Spiking neuron model in the wave propagation layer

In the performed experiments, the wave propagation layer P is constructed with an identical number of excitatory and inhibitory Izhikevich neurons^20,21, that cover a regular quadratic grid of $41\times 41$ points on the manifold of stimuli.

The spiking behavior of each artificial neuron is modeled as a function of its membrane potential dynamics v(t) using the two coupled ordinary differential equations $\frac{\mathrm {d}}{\mathrm {d}t}v = 0.04 v^2 +5 v + 140 - u + I$ and $\frac{\mathrm {d}}{\mathrm {d}t}u = a\cdot (b v-u)$. Here, v is the membrane potential in mV, u an internal recovery variable, and I represents synaptic or DC input current. The internal parameters a (scale of u / recovery speed) and b (sensitivity of u to fluctuations in v) are dimensionless. Time t is measured in ms. If the membrane potential grows beyond the threshold parameter $v\ge {30}\,mV$, the neuron is spiking and the variables are reset via $v \leftarrow c$ and $u \leftarrow u+d$. Again, c (after-spike reset value of v) and d (after-spike offset value of u) are dimensionless internal parameters.

Table 1 Parameters used in our simulations of the wave propagation layer P.

Full size table

If not stated otherwise in the following, the parameters listed in Table 1a were used for the Izhikevich neurons in P. They correspond to regular spiking (RS) excitatory and fast spiking (FS) inhibitory neurons. In contrast to²⁰, neuron properties were not randomized to allow for reproducible analyses. The effect of a more biologically plausible heterogeneous neuron property and synaptic strength distribution is analyzed under Numerical Experiments below. Compared to²⁰, the coupling strength in P is large to account for the extremely sparse adjacency matrix as every neuron is only connected to its few proximal neighbours in our configuration. Whenever a neuron in P is to be stimulated externally, a DC current of $I=25$ is applied to it. As in²⁰, the simulation time step was fixed to 1 ms with one sub-step in P for numerical stability.

Synaptic connections in the wave propagation layer

As depicted in Fig. 6, the excitatory neurons are driving nearby excitatory and inhibitory neurons with a synaptic strength of

$$\begin{aligned} s_\mathrm {e\rightarrow {}e}(d)&{:}{=} {\left\{ \begin{array}{ll} \dfrac{s_\mathrm {e\rightarrow {}e}^\mathrm {(max)}}{d}, &{} \text {for } 0 < d\le d_\mathrm {e} \\ 0 , &{} \text {else} \end{array}\right. }, \end{aligned}$$

(1)

where $s_\mathrm {e\rightarrow {}i}(d)$ is defined analogously. Here, d is the distance between nodes in the manifold of stimuli. For simplicity, we model this manifold as a two-dimensional quadratic mesh with grid spacing $\delta =1$ where some connections might be missing. The choice $s\propto {1}/{d}$ was made to represent the assumption that recurrent coupling will be strongest to nearest neighbours and will decay with distance. Note that (1) in particular implies that we have $s_\mathrm {e\rightarrow {}e}(0),s_\mathrm {e\rightarrow {}i}(0)=0$, which prevents self-excitation. To restrict to only localized interaction, we exclude interaction beyond a predefined excitation range $d_\mathrm {e}$ and inhibition range $d_\mathrm {i}$, respectively. Values of the parameters in the expressions for the synaptic strengths used in the simulations are given in Table 1c.

The inhibitory neurons suppress activation of the excitatory neurons by reducing their input current via synaptic strength

$$\begin{aligned} s_\mathrm {i\rightarrow {}e}(d)&{:}{=} {\left\{ \begin{array}{ll} s_\mathrm {i\rightarrow {}e}^\mathrm {(max)}, &{} \text {for } d = 0 \\ \dfrac{s_\mathrm {i\rightarrow {}e}^\mathrm {(max)}}{d}, &{} \text {for } 0 < d\le d_\mathrm {s} \\ 0 , &{} \text {else} \end{array}\right. }\quad . \end{aligned}$$

(2)

Wave propagation dynamics

The described setup allows for wave-like expansion of neuronal activity from an externally driven excitatory neuron as shown in Fig. 7.

With the capability of propagating signals as circular waves from the target neuron $t$ across the manifold of stimuli in P, it is now necessary to set up a representation of the start neuron $s$ in C. This will be done in the following subsection before the coupling between P and C will be described.

Neuron model for place cell dynamics

The continuous attractor layer C, implements a sheet of neurons that models the functionality of a network of place cells in the human hippocampus using rate-coding neurons^17,18 and thus the manifold of stimuli. As for the wave propagation layer, we also use a quadratic $41\times 41$ grid of neurons for this layer. Activation in the continuous attractor layer will appear as bump, the center of which represents the most likely current location on the manifold of stimuli.

This bump of activation is used to represent the current position in the graph of synaptic connections representing the cognitive map. Planning in the manifold of stimuli thus amounts to moving the bump through the sheet of neurons where each neuron can be thought of as one node in this graph. With respect e. g. to the robot arm example in Fig. 1b, the place cell bump represents the current state of the system i. e. the current angles of the arm’s two degrees-of-freedom. As the bump moves through the continuous attractor layer, and thus through the graph, the robot arm will alter its configuration creating a movement trajectory through the 2D space.

Synaptic connectivity to realize continuous attractor dynamics

Our methodology for modelling the continuous attractor place cell dynamics adapts the computational approach used in¹⁹ by including a computational consideration for synaptic connections between continuous attractor neurons and an associated update rule that depends on information from the wave propagation layer P.

The synaptic weight function connecting each neuron in the continuous attractor sheet to each other neuron is given by a weighted Gaussian. This allows for the degrading activation of cells in the immediate neighbourhood of a given neuron and the simultaneous inhibition of neurons that are further away, thus giving rise to the bump-shaped activity in the sheet itself. The mathematical implementation of these synaptic connections also allows for the locus of activation in the sheet to be shifted in a given direction which is, in turn, how the graph implemented by this neuron sheet is able to be traversed.

The synaptic weight $w_{\vec {i},\vec {j}}\in \mathbb {R}^{(N_x\times N_y)\times (N_x\times N_y)}$ connecting a neuron at position $\vec {i}=(i_x, i_y)$ to a neuron at position $\vec {j}=(j_x, j_y)$ is given by

$$\begin{aligned} w_{\vec {i},\vec {j}}&{:}{=} J\cdot \exp \left( -\frac{1}{\sigma ^{2}}\left\Vert \left( \frac{i_x-j_x}{N_x},\frac{i_y-j_y}{N_y}\right) +\vec {\Delta }(t)\right\Vert ^{2}\right) -T\,. \end{aligned}$$

(3)

Here, J determines the strength of the synaptic connections, $\Vert \cdot \Vert$ is the Euclidean norm, $\sigma$ modulates the width of the Gaussian, T shifts the Gaussian by a fixed amount, $\vec {\Delta }(t)$ is a direction vector which we discuss in detail later, and $N_{x}$ and $N_{y}$ give the size of the two dimensions of the sheet.

In order to update the activation of the continuous attractor neurons and to subsequently move the bump of activation across the neuron sheet, we compute the activation $A_{\vec {j}}$ of the continuous attractor neuron $\vec {j}$ at time $t+1$ using

$$\begin{aligned} B_{\vec {j}}(t+1)&= \sum _{\vec {i}}A_{\vec {i}}(t)w_{\vec {i},\vec {j}}(t)\,, \end{aligned}$$

(4)

$$\begin{aligned} A_{\vec {j}}(t+1)&= (1-\tau )B_{\vec {j}}(t+1)+\tau \frac{B_{\vec {j}}(t+1)}{\sum _{\vec {i}} A_{\vec {i}}(t)}\,, \end{aligned}$$

(5)

where $B_{\vec {j}}(t+1)$ is a transfer function that accumulates the incoming current from all neurons to neuron $\vec {j}$ and $\tau$ is a fixed parameter that determines stabilization towards a floating average activity.

Table 2 Parameters for the continuous attractor layer C.

Full size table

Simulation parameters for the continuous attractor layer C are given in Table 2. They have been manually tuned to ensure development of stable, Gaussian shaped activity with an effective diameter of approximately twelve neurons in C.

As in¹⁹, a direction vector $\vec {\Delta }(t)\in \mathbb {R}^2$ has been introduced in Eq. (3). It has the effect of shifting the synaptic weights in a particular direction which in turn causes the location of the activation bump in the attractor layer to shift to a neighbouring neuron. In other words, it is this direction vector that allows the graph to be traversed by informing the place cell sheet from which direction the wave front is coming in P. Thus all that remains for the completion of the necessary computations is to compute $\vec {\Delta }(t)$ as a function of the propagating wave and the continuous attractor position.

Layer interaction—direction vector

The interaction between the wave propagation layer P and the continuous attractor layer C is mediated via the direction vector $\vec {\Delta }(t)$. The direction vector is computed such that it points from the center of the bump of activity towards the center of the overlap between bump and incoming wave as follows. Let $\mathcal {C}_t$ and $\mathcal {P}_t$ denote the sets of positions of active neurons at time t in layer C and P, respectively. Note that each possible position corresponds to exactly one neuron in the wave propagation layer and exactly one neuron in the continuous attractor layer as they have the same spatial resolution in the implementation. Now let $\mathcal {A}_{t}{:}{=} \mathcal {C}_t\cap \mathcal {P}_t$. Then,

$$\begin{aligned} \mathop {{\text {mean}}}\left( \mathcal {A}_{t}\right)&= \frac{1}{\left| \mathcal {A}_{t}\right| }\sum _{\vec {i}\in \mathcal {A}_{t}}\vec {i} \end{aligned}$$

(6)

is the average position of overlap. We compute the direction vector from the current position $p_t$ of the central neuron in the continuous attractor layer activation bump to $\mathop {{\text {mean}}}\left( \mathcal {A}_{t}\right)$ via

$$\begin{aligned} \vec {\Delta }(t)&= \mathop {{\text {mean}}}\left( \mathcal {A}_{t}\right) - p_t\,. \end{aligned}$$

(7)

Layer interaction—recovery period

In order to prevent the wave from interacting with the back side of the bump in C and thus pulling it back again, we introduce a recovery period R of a few time steps after moving the bump. During R, which is selected as the ratio of bump size to wave propagation speed, $\mathcal {A}_{t}$ is assumed to be empty, which prevents any further movement. In our experiments, we used $R={12}\,ms$. As the bump had a diameter of eleven cells and the maximum wave propagation speed was one cell per ms, this allowed every wave front to interact with the bump at most once.

Numerical experiments

In order to test the complex neuronal network configuration described in the previous sections and to study its properties and dynamics, we performed numerical experiments using multiple different setups. Source code used for our studies is published at⁷⁴. Results of our simulations are presented in the “Results of the numerical experiments” section. In the following, we will add some more in-depth analyses on specific properties of the model as observed in the simulations.

Transmission velocity

In our setup, no synaptic transmission delay, as e. g. in⁷⁵, is implemented. As, due to the strong nearest-neighbour connectivity, only few pre-synaptic spiking neurons are sufficient to raise the membrane potential above threshold, the waves are travelling across P with a velocity of approximately one neuronal “ring” per time step, cf. Fig. 4. In contrast, the continuous attractor can only move a distance of at most half its width per incoming wave. Accordingly, its velocity is tightly coupled to the spike frequency of the stimulated neuron while still being bound due to the recovery period R.

Obstacles and complex setups

In the S-shaped maze Fig. 5a, the continuous attractor activity moves towards the target node $t$ on a direct path around the obstacles. Due to the optimal path being more than two times longer than in Fig. 4, the time to reach the target is accordingly longer as well. This is also in line with the required travel times from $s$ to $t$ in Fig. 5b,c, where—despite its complexity—a path through the maze is found fastest due to it being shorter than in the other cases of Fig. 5. This observation is also evidenced by the fact that our model is a parallelized version of BFS, cf. “Relation to existing graph traversal algorithms”, which is guaranteed to find the shortest path in an unweighted and undirected graph.

Heterogeneous neuron properties and synaptic strengths

In the simulation experiments described up to now, a homogeneous wave propagation layer P is employed. There, all neurons are subject to the same internal parameters, being either regular spiking excitatory neurons or fast spiking inhibitory neurons. Also, synaptic strengths are strictly set as described previously with parameters from Table 1c. This setup is rather artificial. Natural neuronal networks will exhibit a broad variability in neuron properties and in the strength of synaptic connectivity.

To account for this natural variability, we randomized the individual neuron’s internal properties as suggested in²⁰, see Table 1b. As in²⁰, heterogeneity is achieved by randomizing neuron model parameters using random variables $r_e$ and $r_i$ for each excitatory and inhibitory neuron. These are equally distributed in the interval [0; 1] and vary neuron models between regular spiking ($r_e=0$) and chattering (CH, $r_e=1$) or fast spiking ($r_i=1$) for excitatory neurons and low-threshold spiking (LTS, $r_i=0$) for inhibitory neurons. By squaring $r_e$, the excitatory neuron distribution is biased towards RS. In addition, after initializing synaptic strengths in P, we randomly varied them individually by up to $\pm {10}\, \%$.

Despite this strong modification to the original numerically ideal setup, a structured wave propagation is still possible in P as can be seen in Fig. 8. While the stereotypical circular form of the wave fronts dissolves in the simulation, they continue to traverse P completely. As before, they reach the continuous attractor bump and are able to guide it to their origin. Apparently, the overall connection scheme in P is more important for stable wave propagation than homogeneity in the individual synaptic strengths and neuron properties.

An interesting aspect of this simulation when compared to Fig. 5b is the apparent capability of solving the graph traversal problem quicker than with the homogeneous neuronal network. This is an artifact of the explicitly broken symmetry in the heterogeneous configuration: The wave fronts from different directions differ in shape when arriving at the initial position of the continuous attractor layer activity. Thus, one of them is immediately preferred and target-oriented movement of the bump starts earlier than before. This capability of breaking symmetries and thus quickly resolving ambiguous situations is an explicit advantage of the more biologically realistic heterogeneous configuration.

References

Singer, W. & Lazar, A. Does the cerebral cortex exploit high-dimensional, nonlinear dynamics for information processing?. Front. Computat. Neuro-sci. 10, 99. https://doi.org/10.3389/fncom.2016.00099 (2016).
Article Google Scholar
Miller, E. K. & Buschman, T. J. Cortical circuits for the control of attention. Curr. Opin. Neurobiol. 23(2), 216–222. https://doi.org/10.1016/j.conb.2012.11.011 (2013).
Article CAS PubMed Google Scholar
Kveraga, K., Ghuman, A. S. & Bar, M. Top-down predictions in the cognitive brain. Brain Cogn. 65(2), 145–168. https://doi.org/10.1016/j.bandc.2007.06.007 (2007).
Article PubMed PubMed Central Google Scholar
Douglas, R. J. & Martin, K. A. Recurrent neuronal circuits in the neocortex. Curr. Biol. 17(13), R496–R500. https://doi.org/10.1016/j.cub.2007.04.024 (2007).
Article CAS PubMed Google Scholar
Rumelhart, D. E. & Zipser, D. Feature discovery by competitive learning. Cogn. Sci. 9(1), 75–112. https://doi.org/10.1207/s15516709cog0901_5 (1985).
Article Google Scholar
Curto, C. & Itskov, V. Cell groups reveal structure of stimulus space. PLoS Comput. Biol. 4(10), e1000205. https://doi.org/10.1371/journal.pcbi.1000205 (2008).
Article ADS MathSciNet CAS PubMed PubMed Central Google Scholar
Miller, K. D. Development of orientation columns via competition between on and off center inputs. NeuroReport 3, 73–76 (1992).
Article CAS Google Scholar
Erwin, E., Obermayer, K. & Schulten, K. Models of orientation and ocular dominance columns in the visual cortex: A critical comparison. Neural Comput. 7(3), 425–468 (1995).
Article CAS Google Scholar
Kim, R. & Sejnowski, T. J. Strong inhibitory signaling underlies stable temporal dynamics and working memory in spiking neural networks. Nat. Neurosci. 24(1), 129–139. https://doi.org/10.1038/s41593-020-00753-w (2021).
Article CAS PubMed Google Scholar
Xie, Y. et al. Neural mechanisms of working memory accuracy revealed by recurrent neural networks. Front. Syst. Neurosci. 16, 760864. https://doi.org/10.3389/fnsys.2022.760864 (2022).
Article PubMed PubMed Central Google Scholar
Wang, Z. et al. Recurrent spiking neural network with dynamic presynaptic currents based on backpropagation. Int. J. Intell. Syst. 37(3), 2242–2265. https://doi.org/10.1002/int.22772 (2022).
Article Google Scholar
Hochreiter, S. & Schmidhuber, J. Long short-term memory. Neural Comput. 9(8), 1735–1780. https://doi.org/10.1162/neco.1997.9.8.1735 (1997).
Article CAS PubMed Google Scholar
Rolls, E. T. Attractor networks. WIREs Cogn. Sci. 1(1), 119–134. https://doi.org/10.1002/wcs.1 (2010).
Article Google Scholar
Amari, S.-I. Dynamics of pattern formation in lateral-inhibition type neural fields. Biol. Cybern. 27(2), 77–87 (1977).
Article MathSciNet CAS Google Scholar
Taylor, J. G. The Race for Consciousness (MIT Press, 1999).
Book Google Scholar
Kolb, B., Whishaw, I. & Teskey, G. C. An Introduction to Brain and Behavior 6th edn. (Macmillan Learning, 2019).
Google Scholar
O’Keefe, J. & Dostrovsky, J. The hippocampus as a spatial map. Preliminary evidence from unit activity in the freely-moving rat. Brain Res. 34(1), 171–175. https://doi.org/10.1016/0006-8993(71)90358-1 (1971).
Article PubMed Google Scholar
O’Keefe, J. Place units in the hippocampus of the freely moving rat. Exp. Neurol. 51(1), 78–109. https://doi.org/10.1016/0014-4886(76)90055-8 (1976).
Article PubMed Google Scholar
Guanella, A., Kiper, D. & Verschure, P. A model of grid cells based on a twisted torus topology. Int. J. Neural Syst. 17, 231–40. https://doi.org/10.1142/S0129065707001093 (2007).
Article PubMed Google Scholar
Izhikevich, E. M. Simple model of spiking neurons. IEEE Trans. Neural Netw. 14(6), 1569–1572. https://doi.org/10.1109/TNN.2003.820440 (2003).
Article MathSciNet CAS PubMed Google Scholar
Izhikevich, E. M. Which model to use for cortical spiking neurons?. IEEE Trans. Neural Netw. 15(5), 1063–1070. https://doi.org/10.1109/TNN.2004.832719 (2004).
Article PubMed Google Scholar
Tolman, E. C. & Honzik, C. H. Introduction and removal of reward and maze performance in rats. University of California publications in psychology Vol. 4, no. 17. (University of California Press, 1930).
Tolman, E. C. Cognitive maps in rats and men. Psychol. Rev. 55(4), 189–208. https://doi.org/10.1037/h0061626 (1948).
Article CAS PubMed Google Scholar
O’Keefe, J. & Nadel, L. The Hippocampus as a Cognitive Map (Clarendon Press, 1978). https://doi.org/10.1016/j.neuron.2015.06.013.
Book Google Scholar
Bush, D. et al. Using grid cells for navigation. Neuron 87(3), 507–520. https://doi.org/10.1016/j.neuron.2015.07.006 (2015).
Article CAS PubMed PubMed Central Google Scholar
Hafting, T. et al. Microstructure of a spatial map in the entorhinal cortex. Nature 436, 801–6. https://doi.org/10.1038/nature03721 (2005).
Article ADS CAS PubMed Google Scholar
Taube, J., Muller, R. & Ranck, J. Head-direction cells recorded from the postsubiculum in freely moving rats. I. Description and quantitative analysis. J. Neurosci. 10(2), 420–435. https://doi.org/10.1523/JNEUROSCI.10-02-00420.1990 (1990).
Article CAS PubMed PubMed Central Google Scholar
Gauthier, J. L. & Tank, D. W. A dedicated population for reward coding in the hippocampus. Neuron 99(1), 179-193.e7. https://doi.org/10.1016/j.neuron.2018.06.008 (2018).
Article CAS PubMed PubMed Central Google Scholar
Constantinescu, A. O., O’Reilly, J. X. & Behrens, T. E. J. Organizing conceptual knowledge in humans with a gridlike code. Science 352(6292), 1464–1468. https://doi.org/10.1126/science.aaf0941 (2016).
Article ADS CAS PubMed PubMed Central Google Scholar
Doeller, C., Barry, C. & Burgess, N. Evidence for grid cells in a human memory network. Nature 463, 657–61. https://doi.org/10.1038/nature08704 (2010).
Article ADS CAS PubMed PubMed Central Google Scholar
Cameron, K. et al. Human hippocampal neurons predict how well word pairs will be remembered. Neuron 30, 289–98. https://doi.org/10.1016/S0896-6273(01)00280-X (2001).
Article CAS PubMed Google Scholar
Alvarez, P., Wendelken, L. & Eichenbaum, H. Hippocampal formation lesions impair performance in an odor-odor association task independently of spatial context. Neurobiol. Learn. Mem. 78, 470–476. https://doi.org/10.1006/nlme.2002.4068 (2002).
Article PubMed Google Scholar
Aronov, D., Nevers, R. & Tank, D. Mapping of a non-spatial dimension by the hippocampal-entorhinal circuit. Nature 543, 719–722. https://doi.org/10.1038/nature21692 (2017).
Article ADS CAS PubMed PubMed Central Google Scholar
Sakurai, Y. Coding of auditory temporal and pitch information by hippocampal individual cells and cell assemblies in the rat. Neuroscience 115(4), 1153–1163. https://doi.org/10.1016/S0306-4522(02)00509-2 (2002).
Article CAS PubMed Google Scholar
Eichenbaum, H. et al. Cue-sampling and goal-approach correlates of hippocampal unit activity in rats performing an odor-discrimination task. J. Neurosci. 7(3), 716–732. https://doi.org/10.1523/JNEUROSCI.07-03-00716.1987 (1987).
Article CAS PubMed PubMed Central Google Scholar
Fried, I., MacDonald, K. A. & Wilson, C. L. Single neuron activity in human hippocampus and amygdala during recognition of faces and objects. Neuron 18(5), 753–765. https://doi.org/10.1016/S0896-6273(00)80315-3 (1997).
Article CAS PubMed Google Scholar
Ko, H. et al. The emergence of functional microcircuits in visual cortex. Nature 496(7443), 96–100. https://doi.org/10.1038/nature12015 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Iacaruso, M. F., Gasler, I. T. & Hofer, S. B. Synaptic organization of visual space in primary visual cortex. Nature 547(7664), 449–452. https://doi.org/10.1038/nature23019 (2017).
Article CAS PubMed PubMed Central Google Scholar
Ko, H. et al. Functional specificity of local synaptic connections in neocortical networks. Nature 473(7345), 87–91. https://doi.org/10.1038/nature09880 (2011).
Article ADS CAS PubMed PubMed Central Google Scholar
DiCarlo, J. J., Johnson, K. O. & Hsiao, S. S. Structure of receptive fields in area 3b of primary somatosensory cortex in the alert monkey. J. Neurosci. 18(7), 2626–2645. https://doi.org/10.1523/JNEUROSCI.18-07-02626.1998 (1998).
Article CAS PubMed PubMed Central Google Scholar
Wallace, D. J. & Sakmann, B. Plasticity of representational maps in somatosensory cortex observed by in vivo voltage-sensitive dye imaging. Cereb. Cortex 18(6), 1361–1373. https://doi.org/10.1093/cercor/bhm168 (2008).
Article PubMed Google Scholar
Broser, P. et al. Critical period plasticity of axonal arbors of layer 2/3 pyramidal neurons in rat somatosensory cortex: Layer-specific reduction of projections into deprived cortical columns. Cereb. Cortex 18(7), 1588–1603. https://doi.org/10.1093/cercor/bhm189 (2008).
Article CAS PubMed Google Scholar
Vidyasagar, R., Folger, S. E. & Parkes, L. M. Re-wiring the brain: Increased functional connectivity within primary somatosensory cortex following synchronous co-activation. NeuroImage 92, 19–26. https://doi.org/10.1016/j.neuroimage.2014.01.052 (2014).
Article PubMed Google Scholar
Delhaye, B. P., Long, K. H. & Bensmaia, S. J. Neural basis of touch and proprioception in primate cortex. Compr. Physiol. 8, 1575–1602. https://doi.org/10.1002/cphy.c170033 (2018).
Article PubMed PubMed Central Google Scholar
Muller, L. et al. Cortical travelling waves: Mechanisms and computational principles. Nat. Rev. Neurosci. 19(5), 255–268. https://doi.org/10.1038/nrn.2018.20 (2018).
Article CAS PubMed PubMed Central Google Scholar
Sato, T. K., Nauhaus, I. & Carandini, M. Traveling waves in visual cortex. Neuron 75(2), 218–229. https://doi.org/10.1016/j.neuron.2012.06.029 (2012).
Article CAS PubMed Google Scholar
Rubino, D., Robbins, K. A. & Hatsopoulos, N. G. Propagating waves mediate information transfer in the motor cortex. Nat. Neurosci. 9(12), 1549–1557. https://doi.org/10.1038/nn1802 (2006).
Article CAS PubMed Google Scholar
Takahashi, K. et al. Large-scale spatiotemporal spike patterning consistent with wave propagation in motor cortex. Nat. Commun. 6(1), 7169. https://doi.org/10.1038/ncomms8169 (2015).
Article ADS PubMed Google Scholar
Muller, L. et al. The stimulus-evoked population response in visual cortex of awake monkey is a propagating wave. Nat. Commun. 5(1), 3675. https://doi.org/10.1038/ncomms4675 (2014).
Article ADS PubMed Google Scholar
Barry, C. et al. The boundary vector cell model of place cell firing and spatial memory. Rev. Neurosci. 17(1), 71–98. https://doi.org/10.1515/REVNEURO.2006.17.1-2.71 (2006).
Article PubMed PubMed Central Google Scholar
Jeffery, K. J. Place cells, grid cells, attractors, and remapping. Neural Plast. 2011, 1–11. https://doi.org/10.1155/2011/182602 (2011).
Article Google Scholar
Quirk, G., Muller, R. & Kubie, J. The firing of hippocampal place cells in the dark depends on the rat’s recent experience. J. Neurosci. 10(6), 2008–2017. https://doi.org/10.1523/JNEUROSCI.10-06-02008.1990 (1990).
Article CAS PubMed PubMed Central Google Scholar
Michael, C. S. T., Graziano, S. A. & Moore, T. Complex movements evoked by microstimulation of precentral cortex. Neuron 34, 841–851 (2002).
Article Google Scholar
Graziano, M. S. Ethological action maps: A paradigm shift for the motor cortex. Trends Cogn. Sci. 20(2), 121–132. https://doi.org/10.1016/j.tics.2015.10.008 (2016).
Article PubMed Google Scholar
Budri, M., Lodi, E. & Franchi, G. Sensorimotor restriction affects complex movement topography and reachable space in the rat motor cortex. Front. Syst. Neurosci. 8, 231. https://doi.org/10.3389/fnsys.2014.00231 (2014).
Article PubMed PubMed Central Google Scholar
Brown, A. R. & Teskey, G. C. Motor cortex is functionally organized as a set of spatially distinct representations for complex movements. J. Neurosci. 34(41), 13574–13585. https://doi.org/10.1523/JNEUROSCI.2500-14.2014 (2014).
Article CAS PubMed PubMed Central Google Scholar
Ramanathan, D., Conner, J. M. & Tuszynski, M. H. A form of motor cortical plasticity that correlates with recovery of function after brain injury. Proc. Natl. Acad. Sci. 103(30), 11370–11375. https://doi.org/10.1073/pnas.0601065103 (2006).
Article ADS CAS PubMed PubMed Central Google Scholar
Pearson, J. The human imagination: The cognitive neuroscience of visual mental imagery. Nat. Rev. Neurosci. 20(10), 624–634. https://doi.org/10.1038/s41583-019-0202-9 (2019).
Article CAS PubMed Google Scholar
Shepard, R. N. & Metzler, J. Mental rotation of three-dimensional objects. Science 171(3972), 701–703. https://doi.org/10.1126/science.171.3972.701 (1971).
Article ADS CAS PubMed Google Scholar
Cooper, L. A. & Shepard, R. N. Chronometric studies of the rotation of mental images. In Visual Information Processing xiv, 555- xiv, 555 (Academic, 1973).
Graziano, M. S. A., Aalo, T. N. S. & Cooke, D. F. Arm movements evoked by electrical stimulation in the motor cortex of monkeys. J. Neurophysiol. 94(6), 4209–4223. https://doi.org/10.1152/jn.01303.2004 (2005).
Article PubMed Google Scholar
Muller, R. U., Stead, M. & Pach, J. The hippocampus as a cognitive graph. J. Gen. Physiol. 107(6), 663–694. https://doi.org/10.1085/jgp.107.6.663 (1996).
Article CAS PubMed PubMed Central Google Scholar
Aimone, J. B. et al. Dynamic programming with spiking neural computing. In Proceedings of the International Conference on Neuromorphic Systems. ICONS ’19: International Conference on Neuromorphic Systems, 1-9 (ACM, 2019). https://doi.org/10.1145/3354265.3354285.
Aimone, J. B. et al. Provable advantages for graph algorithms in spiking neural networks. In Proceedings of the 33rd ACM Symposium on Parallelism in Algorithms and Architectures. SPAA ’21: 33rd ACM Symposium on Parallelism in Algorithms and Architectures. Virtual Event USA 35–47 (ACM, 2021) https://doi.org/10.1145/3409964.3461813.
Hamilton, K. E., Mintz, T. M. & Schuman, C. D. Spike-Based Primitives for Graph Algorithms. (2019). http://arxiv.org/abs/1903.10574 (Accessed 10 Apr 2021).
Kay, B., Date, P. & Schuman, C. Neuromorphic graph algorithms: extracting longest shortest paths and minimum spanning trees. In Proceedings of the Neuro-Inspired Computational Elements Workshop. NICE ’20: Neuro-Inspired Computational Elements Workshop, 1–6 (ACM, 2020). https://doi.org/10.1145/3381755.3381762.
Barbier, T., Teuliere, C. & Triesch, J. Spike timing-based unsupervised learning of orientation, disparity, and motion representations in a spiking neural network. In 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), 1377–1386. (IEEE, 2021). https://doi.org/10.1109/CVPRW53098.2021.00152.
Parvizi-Fard, A. et al. A functional spiking neuronal network for tactile sensing pathway to process edge orientation. Sci. Rep. 11(1), 1320. https://doi.org/10.1038/s41598-020-80132-4 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Chen, G. & Gong, P. Computing by modulating spontaneous cortical activity patterns as a mechanism of active visual processing. Nat. Commun. 10(1), 4915. https://doi.org/10.1038/s41467-019-12918-8 (2019).
Article ADS CAS PubMed PubMed Central Google Scholar
Pulvermüller, F. et al. Biological constraints on neural network models of cognitive function. Nat. Rev. Neurosci. 22(8), 488–502. https://doi.org/10.1038/s41583-021-00473-5 (2021).
Article CAS PubMed PubMed Central Google Scholar
Galinsky, V. L. & Frank, L. R. Universal theory of brain waves: From linear loops to nonlinear synchronized spiking and collective brain rhythms. Phys. Rev. Res. 2(2), 023061 https://doi.org/10.1103/PhysRevResearch.2.023061 (2020).
Article CAS PubMed PubMed Central Google Scholar
Galinsky, V. L. & Frank, L. R. Brain waves: Emergence of localized, persistent, weakly evanescent cortical loops. J. Cogn. Neurosci. 32(11), 2178–2202. https://doi.org/10.1162/jocn_a_01611 (2020).
Article PubMed PubMed Central Google Scholar
Corballis, M. C. et al. Decisions about identity and orientation of rotated letters and digits. Mem. Cogn. 6(2), 98–107. https://doi.org/10.3758/bf03197434 (1978).
Article CAS Google Scholar
Powell, H. & Winkel, M. Hybrid Neuron Simulation. https://github.com/emdgroup/brain_waves_for_planning_problems. (2021).
Izhikevich, E. M. Polychronization: Computation with spikes. Neural Comput. 18(2), 245–282. https://doi.org/10.1162/089976606775093882 (2006).
Article MathSciNet PubMed MATH Google Scholar

Download references

Acknowledgements

The authors gratefully acknowledge funding from the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation program (Grant agreement 677270) which allowed H. P. to undertake the work described in the above. We also thank Raul Mureşan and Robert Klassert for providing very valuable comments on early drafts of the paper.

Author information

Authors and Affiliations

Merck KGaA, Darmstadt, Germany
Henry Powell, Mathias Winkel, Alexander V. Hopp & Helmut Linde
University of Glasgow, Glasgow, Scotland, UK
Henry Powell
Transylvanian Institute of Neuroscience, Cluj-Napoca, Romania
Helmut Linde

Authors

Henry Powell
View author publications
You can also search for this author in PubMed Google Scholar
Mathias Winkel
View author publications
You can also search for this author in PubMed Google Scholar
Alexander V. Hopp
View author publications
You can also search for this author in PubMed Google Scholar
Helmut Linde
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

All authors jointly conceived of model, wrote the manuscript text and prepared the figures. H.P. and M.W. implemented the model computationally and ran the simulations required for the experiments.

Corresponding author

Correspondence to Henry Powell.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Video 1.

Supplementary Video 2.

Supplementary Video 3.

Supplementary Video 4.

Supplementary Video 5.

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Powell, H., Winkel, M., Hopp, A.V. et al. A hybrid biological neural network model for solving problems in cognitive planning. Sci Rep 12, 10628 (2022). https://doi.org/10.1038/s41598-022-11567-0

Download citation

Received: 17 November 2021
Accepted: 12 April 2022
Published: 23 June 2022
DOI: https://doi.org/10.1038/s41598-022-11567-0

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.