E-I balance emerges naturally from continuous Hebbian learning in autonomous neural networks

Trapp, Philip; Echeveste, Rodrigo; Gros, Claudius

doi:10.1038/s41598-018-27099-5

Download PDF

Article
Open access
Published: 12 June 2018

E-I balance emerges naturally from continuous Hebbian learning in autonomous neural networks

Scientific Reports volume 8, Article number: 8939 (2018) Cite this article

3729 Accesses
8 Citations
1 Altmetric
Metrics details

Subjects

Abstract

Spontaneous brain activity is characterized in part by a balanced asynchronous chaotic state. Cortical recordings show that excitatory (E) and inhibitory (I) drivings in the E-I balanced state are substantially larger than the overall input. We show that such a state arises naturally in fully adapting networks which are deterministic, autonomously active and not subject to stochastic external or internal drivings. Temporary imbalances between excitatory and inhibitory inputs lead to large but short-lived activity bursts that stabilize irregular dynamics. We simulate autonomous networks of rate-encoding neurons for which all synaptic weights are plastic and subject to a Hebbian plasticity rule, the flux rule, that can be derived from the stationarity principle of statistical learning. Moreover, the average firing rate is regulated individually via a standard homeostatic adaption of the bias of each neuron’s input-output non-linear function. Additionally, networks with and without short-term plasticity are considered. E-I balance may arise only when the mean excitatory and inhibitory weights are themselves balanced, modulo the overall activity level. We show that synaptic weight balance, which has been considered hitherto as given, naturally arises in autonomous neural networks when the here considered self-limiting Hebbian synaptic plasticity rule is continuously active.

U(1) dynamics in neuronal activities

Article Open access 21 October 2022

Chia-Ying Lin, Ping-Han Chen, … Wen-Min Huang

Jensen’s force and the statistical mechanics of cortical asynchronous states

Article Open access 23 October 2019

Victor Buendía, Pablo Villegas, … Miguel A. Muñoz

Dynamics of phase oscillator networks with synaptic weight and structural plasticity

Article Open access 02 September 2022

Kanishk Chauhan, Ali Khaledi-Nasab, … Peter A. Tass

Introduction

It is well established that a balance between excitation and inhibition, usually denoted as E-I balance, arises during spontaneous cortical activity, both in vitro^1,2,3,4 and in the intact and spontaneously active cortex^4,5,6,7. This balance, which refers to a relatively constant ratio between excitatory and inhibitory inputs to a neuron, has been theoretically predicted as way to explain how cortical networks are able to sustain stable though temporally irregular, and even chaotic, dynamics^8,9,10. Since then, the ramifications of such a balanced state in terms of both dynamics and computation have been widely studied, showing how E-I balance results in critical-state dynamics of avalanches and oscillations¹¹, with direct implications for the dynamic range¹², storage of information¹³, and computational power¹⁴ of networks.

Recurrent neural networks can use E-I balance to generate asynchronous states even in the presence of strongly shared inputs¹⁵. Indeed, nearby cortical neurons with similar orientation tuning show low correlated variability, potentially simplifying the decoding of information by a population of such neurons¹⁶. Balanced networks have also been shown to work potentially in at least two different regimes, linking richness of the internal dynamics, connectivity strength, and functionality: a weak coupling state favoring information transmission, and a strongly coupled state, characterized by complex internal dynamics which could be employed for information processing¹⁷. Modulating the ratio between excitation and inhibition it is furthermore possible to selectively switch information gating and rerouting between different circuits on and off¹⁸.

The direct link between E-I balance and information transmission, together with observations of an atypical ratio of excitation/inhibition in neurobehavioral syndromes such as autism, has led to the hypothesis that an abnormal degree of E-I balance might be behind a series of psychiatric disorders¹⁹. Indeed, later causal experimental studies in mice have shown how further elevation of E-I balance, above typical physiological levels, produce a strong impairment of information processing and result in social deficits consistent with those of humans suffering from these conditions²⁰.

It has been shown that networks of supralinear excitatory and inhibitory neurons, namely of neurons whose non-linearities are purely expansive (no saturation) and which would therefore tend to exhibit unstable behavior, can be stabilized choosing the right type of connectivity matrices, resulting in stabilized loosely balanced dynamics²¹. These networks, denoted stabilized supralinear networks (SSN), are able to capture a wide range of experimental findings of visual cortical neurons including contextual modulation and normalization, spatial properties of intracortical connections²², as well as stimulus dependence of neural variability²³.

Different approaches have been taken in the past to construct balanced neural networks for numerical simulations. When van Vreeswijk and Sompolinsky introduced the balanced network model⁸ they constructed the connectivity matrix using sparse random connections, where the overall connection strength was forced to be inversely proportional to the square root of the number of connections. The conditions for stability of the (SSN) have also been studied analytically²¹, and the weights in these type of networks are typically selected so that the network is stable^22,23. Balance has also been a topic of study in non-chaotic networks designed for generation of complex movement. Termed “stability-optimized circuits” (SOCs)²⁴, in these networks balance is achieved by an optimizer modifying inhibitory connections, together with a mechanism able to prune or add new synapses.

These approaches did not however tackle the issue of how the brain could find those weight configurations. In particular, a key question is whether E-I balance in the brain is the result of genetically encoded synaptic strengths or, alternatively, whether ongoing internal synaptic adaption may lead to a dynamic configuration of balanced synaptic weights. In other words, whether the distribution of synaptic weights is a priori given or the result of a self-organizing process.

Several studies in recent years have proposed the use of inhibitory synaptic plasticity (ISP) to attain balance (see²⁵, for a comprehensive review). Within those, an important step towards the understanding of how balance can be achieved in an unsupervised fashion in the brain has been the work of Vogels et al.²⁶, who have shown in simulations how single neurons with constant (or controlled) E weights and plastic I weights, receiving an external stimulus can attain balance, and also restore it after the excitatory synaptic weights are modified. Moreover the authors also show how balance can be attained in random recurrent networks where only the connections from I cells to E cells are plastic via ISP.

Both excitatory and inhibitory connections, as well as the overall excitability of neurons are plastic and constantly evolve together with the neural activity in the brain, with a variety of plasticity mechanisms operating at different timescales^{27,28,29,30,31}. The key question we address here is whether it would be possible for balance to be attained under these conditions in the brain, in a completely unsupervised way. We find here that this is indeed a plausible option. Namely, we show how simulated networks of non-linear excitatory and inhibitory neurons, evolving autonomously under local, online intrinsic and synaptic plasticity rules, generically achieve states which are balanced both with respect to the distribution of the synaptic weights and with respect to the inputs individual neurons receive. Hence we refer to this type of networks with continuously evolving synaptic and intrinsic parameters as Self-organized Plastic Balanced Network (SOPBN). No external optimization is here employed and the procedure is shown to be robust also to the addition of external noise.

Methods

We consider autonomous Erdös-Rényi networks containing N neurons characterized by a linking probability p. The membrane potential x_i of the rate-encoding neurons obeys

$${\dot{x}}_{i}=\frac{1}{\tau }({x}_{i}^{(inp)}-{x}_{i}),\,{x}_{i}^{(inp)}=\sum _{j}\,{w}_{ij}{y}_{j},\,{y}_{i}=\frac{1}{1+{{\rm{e}}}^{{b}_{i}-{x}_{i}}},$$

(1)

where y_i is the firing rate, b_i the threshold and w_ij are the internal synaptic weights. There is no external input. In particular, no external source of noise is present in the main analysis of the system (we show in the Supplementary Material how these results are robust to the addition of a finite amount of external noise). The membrane time constant τ is set to 10 ms for inhibitory and respectively to 20 ms for excitatory neurons.

The neural model we employ is described by a non-linear relation between membrane potentials and firing rates and has been used in previous work³² to derive the Hebbian plasticity rules we will later employ. This transformation is expansive for low firing rates and saturates for very high rates. While a saturation of this type is unavoidable for any realistic biological system, cortical neurons have always been observed to behave in the low firing rate regime, where this saturation is not visible, and the transfer function is typically described by a threshold-powerlaw $y\propto {\lfloor x\rfloor }^{n}$ with exponent n between 1 and 5^33,34,35. We show however in Fig. 1 how, for low firing rates (encouraged by the intrinsic plasticity rule we employ) both functions are virtually indistinguishable.

Adaption of the synaptic weights

The recurrent synaptic weights are continuously adapted using the multiplicative self-limiting Hebbian rule³²

$${\dot{w}}_{ij}={\varepsilon }_{w}\,G({x}_{i})\,H({x}_{i}){y}_{j},\,G({x}_{i})={x}_{0}+{x}_{i}(1-2{y}_{i}),\,H({x}_{i})=2{y}_{i}-1+2{x}_{i}(1-{y}_{i}){y}_{i},$$

(2)

where the membrane potential x_i and the activity y_i of the postysynaptic neuron are related in this model via (1) by a deterministic sigmoidal transfer function. This allows us to write functions G and H as functions of x_i only, where y_i is then simply shorthand for y_i(x_i). This update rule may be derived from an information theoretical principle, the stationarity principle for statistical learning³⁶, which states that the distribution function of the postsynaptic neural activity continuously evolves during the weight adaption process, becoming stationary only once learning is completed. Being autonomous the network considered here is however not confronted with an explicit learning task. Learning denotes in our context therefore the unsupervised process of weight adaption, which minimizes in our case the the Fisher information of the activity of the postsynaptic neuron³².

The limiting term G(x) in (2) changes sign when the postsynaptic activity y_i is either too large or too small in comparison with x₀, reversing hence the Hebbian learning regulated in turn by H(x). This property of G(x) is useful for the learning rule as it prevents runaway synaptic growth, operating as an effective homeostatic synaptic plasticity mechanism, mounted on top of the Hebbian part of the rule³⁷. Our adaption rule, which is also denoted flux rule³², is robust with respect to the actual value selected for the references scale x₀ of the membrane potential, as we checked performing test runs with x₀ = 1 and x₀ = 8. For the simulations presented here we used x₀ = 4.

We note that Hebbian learning rules like (2) are normally formulated not with respect to the bare presynaptic activities, but with respect to the deviation δy_j = y_j − 〈y_j〉 of the presynaptic activity y_j with respect to its time-averaged mean〈y_j〉. The adaption rule (2) performs in that case a principal component analysis for which the signal-to-noise ratio increases with increasing x₀³², being otherwise sensible to input directions y_j characterized by a negative excess kurtosis.

For the study presented further below we use the same adaption rule for all synapses, namely (2), whose self-limiting behavior stabilizes firing rates, rather than trying to reproduce a particular instance of the wide variety of experimentally observed phenomenological spike time dependent synaptic plasticity (STDP) rules for inhibitory connections²⁵. This route would involve therefore the introduction of not well-constrained parameters, transcending in addition the central aims of our investigation. We are interested here to investigate if ongoing Hebbian plasticity and balanced asynchronous dynamics are compatible.

The threshold b_i = b_i(t) entering the transfer function in (1) sets, as usual, the average firing rates. Here we use

$${\dot{b}}_{i}={\varepsilon }_{b}({y}_{i}-{y}_{t})$$

(3)

for the adaption rule for the threshold, which reduces, for y ≈ y_t = 0.2, to the somewhat extended expressions one may derive from homeostatic principles for neural activity^38,39,40. For the adaption rates we used 1/ε_b = 10 and 1/ε_w = 100 (in seconds).

Synaptic pruning

Dale’s law states that neurons are either excitatory or inhibitory, namely that w_ljw_kj ≥ 0 for all l and k. For a Hebbian plasticity rule like (2) to respect Dale’s law one needs to prune a synaptic connection whenever the respective w_ij changes sign. We do this every 1000 ms of mathematical simulation time, reinserting the pruned link with a weight corresponding to 10% of the correspondingly average excitatory or inhibitory links. Performing test runs where the pruned links were reinserted with a strength of 1% of the average mean yielded nearly identical results. For the reinsertion process the postsynaptic neuron i is connected to a random and previously unconnected presynaptic neuron m, with the sign of the new link w_im respecting Dale’s law. There are two possible versions.

Annealed pruning

Links may change sign when the new presynaptic neuron m is selected freely. The overall number of excitatory and inhibitory links may then drift over the course of the simulation, with only the total connectivity remaining constant.

Frozen pruning

Links do not change in character when the new presynaptic neuron m is selected only among those neurons which are of the same type as j. Frozen pruning would correspond from a biological perspective to a separate reshuffling of Gaba and Glutamate receptors.

For the results presented here we considered frozen pruning.

Short-term synaptic plasticity

We also included short-term plasticity (STSP), a mostly presynaptically induced modulation of the synaptic efficacy lasting hundreds of milliseconds to seconds⁴¹. STSP may lead both to synaptic potentiation and depression, resulting respectively from an influx of Ca²⁺ ions into the presynaptic bulb and from a depletion of the available reservoir of neurotransmitters. These effects are captured within the Tsodyks-Markram model⁴² by two variables, u(t) and φ(t), encoding respectively the presynaptic Ca²⁺-concentration and the number of vesicles with neurotransmitters. The transient plasticity rules

$${\dot{u}}_{j}=\frac{1-{u}_{j}}{{T}_{u}}+\alpha ({U}_{max}-{u}_{j}){y}_{j},\,{\dot{\phi }}_{j}=\frac{1-{\phi }_{j}}{{T}_{\phi }}-\beta {\phi }_{j}{u}_{j}{y}_{j},\,{\mathop{w}\limits^{ \sim }}_{ij}={w}_{ij}{\phi }_{j}(t){u}_{j}(t)$$

(4)

then describe the time evolution of the effective synaptic weight ${\tilde{w}}_{ij}$ which is proportional to the bare synaptic weight w_ij, to the number of available vesicles φ_j and to the vesicle’s release probability u_j. In simulations where STSP is present, ${\tilde{w}}_{ij}$ replaces w_ij in (1). STSP is transient in the sense that both u_j and φ_j relax to unity in the absence of presynaptic activity y_j → 0. Typical time evolution curves for the synaptic efficiency multiplier φ_j(t)u_j(t) are presented in Fig. 1.

With the introduction of STSP and making an explicit distinction between E and I inputs, the driving current ${x}_{i}^{(inp)}$ defined in (1) is then generalized to

$${x}_{i}^{(inp)}={x}_{i}^{(exc)}+{x}_{i}^{(inh)},\,{x}_{i}^{(exc)}=\sum _{j\in \{exc\}}\,{w}_{ij}{\phi }_{j}{u}_{j}{y}_{j},\,{x}_{i}^{(inh)}=\sum _{j\in \{inh\}}\,{w}_{ij}{\phi }_{j}{u}_{j}{y}_{j},$$

(5)

where {exc} and {inh} denote respectively the set of excitatory and inhibitory neurons. One can define analogously with

$${\bar{w}}^{(exc)}=\frac{{\sum }_{i,j\in \{exc\}}{w}_{ij}{\phi }_{j}{u}_{j}}{{\sum }_{i,j\in \{exc\}}},\,{\bar{w}}^{(inh)}=\frac{{\sum }_{i,j\in \{inh\}}{w}_{ij}{\phi }_{j}{u}_{j}}{{\sum }_{i,j\in \{inh\}}}$$

(6)

the average excitatory and inhibitory effective synaptic weights.

We note that the original Tsodyks-Markram model⁴² describes STSP for the case of spiking neurons and that one can derive (4) by assuming α = β = 0.01 and that a maximal neural activity of y_j → 1 corresponds to a firing rate of 40 Hz. Typical values for the time scales entering (4) are T_u = 500 ms and T_φ = 200 ms for excitatory synapses in the medial prefrontal cortex of ferrets⁴³ and T_u = 20 ms and T_φ = 700 ms for inhibitory layer 2–4 neurons of the somatosensory cortex of Wistar rats⁴⁴. It has been pointed out, that these time scales are also relevant for behavioral control tasks⁴⁵.

For our simulations we used U_max = 4, α = β = 0.01, T_u = 500 ms and T_φ = 200 ms for all synapses. We did also run control runs involving 500/200 and 20/700 T_u/T_φ pairs respectively for excitatory and inhibitory synapses, which led however only to minor quantitative changes.

Results

We are interested in investigating under which conditions an autonomous neural network, whose dynamics is described by (1), (2), (3) and (4), evolves towards a stable, irregular and balanced state (SOPBN). The results here presented correspond to networks of both excitatory and inhibitory neurons, where 80% of neurons are excitatory and 20% are inhibitory, and whose connections respect Dale’s principle, even when plasticity mechanisms are at play. We have taken membrane time constants of 20 and 10 ms for excitatory and inhibitory cells, respectively. As checks, we have also repeated the simulations with networks consisting of 50% excitatory and 50% inhibitory neurons and with equal membrane time constants, observing no qualitative differences. Unless otherwise stated, we will present results with a total number of neurons N = 400, a fixed 80% fraction of excitatory cells, a link probability p = 0.2 and a target average activity of y_t = 0.2. The initial synaptic weights are drawn from Gaussians with means 7.5 (−30.0) and standard deviations 0.375 (1.5) for excitatory and inhibitory synapses, respectively. Our simulations were performed in all cases with a C++ code running on a standard desktop computer.

Rate encoding neurons with asynchronous activity spikes

We find that the SOPBN tends to evolve to an irregularly bursting state characterized by time scales of the order of 100–200 ms. The data presented in Fig. 2 illustrates typical two second intervals of activity, as obtained directly at initialization and after one hour of mathematical simulation time. It shows the following:

The system state is very different at the beginning and after one hour: While some neurons are constantly quiet or active directly after initialization, the network exhibits pervading bursts after evolving for one hour.
The mean excitatory $\langle {x}_{i}^{(exc)}\rangle $ and inhibitory $\langle {x}_{i}^{(inh)}\rangle $ inputs a neuron receives are both large in magnitude. The substantially smaller value for the overall mean input expresses E-I balance. This E-I balance is present for arbitrary timeframes within the systems evolution. Averaged over time we have
$$\begin{array}{c}{\langle {x}_{i}^{(exc)}\rangle }_{t=0{\rm{s}}}\approx 144.8,\,{\langle {x}_{i}^{(inh)}\rangle }_{t=0{\rm{s}}}\approx -\,147.3,\,{\langle {x}_{i}^{(exc)}\rangle }_{t=0{\rm{s}}}+{\langle {x}_{i}^{(inh)}\rangle }_{t=0{\rm{s}}}\approx -\,2.5,\\ {\langle {x}_{i}^{(exc)}\rangle }_{t=1{\rm{h}}}\approx 41.9,\,{\langle {x}_{i}^{(inh)}\rangle }_{t=1{\rm{h}}}\approx -\,44.1,\,{\langle {x}_{i}^{(exc)}\rangle }_{t=1{\rm{h}}}+{\langle {x}_{i}^{(inh)}\rangle }_{t=1{\rm{h}}}\approx -\,2.2,\end{array}$$
for the system at different times where the brackets denote now averages over the network and over time.
Deviations from the average E-I balance lead to large swings in the membrane potentials and hence to sharp activity spikes. A remarkable feature for rate-encoding neurons evolving with (1) continuously in time.
Bursts in the late network state involve the entire network. All excitatory and inhibitory neurons are active one or more times during a burst, as we have checked. The activities of the individual neurons are however asynchronous.

We also examined the E-I balance ${x}_{i}^{(exc)}+{x}_{i}^{(inh)}$ for individual neurons, obtaining results very close to the network averages shown in Fig. 2. A detailed analysis of the corresponding cross correlations is presented further below.

In²¹ the authors compare the degree of cancellation (or tightness of the balance) between the van Vreeswijk and Sompolinsky balanced networks, and the SSN, showing that while the first kind requires a very high degree of cancellation, the SSN can operate in a regime of loose balance. These networks have however constant synaptic weights and intrinsic parameters. We observe in SOPBNs, where several parameters are plastic, that while most of the time the network follows a high degree of balance (with correlations close to unity as shown in Fig. 7), this tightness is transiently broken to allow for bursts of activity.

Autonomous networks with balanced and increasingly large, but otherwise random synaptic weight distributions, are known to produce a chaotic state in the thermodynamic limit⁹. Testing this prediction we considered the non-adapting case with ε_b = ε_w = 0. By additionally switching off short-term synaptic plasticity, we find that a N = 400 network leads, depending on the initial weight distribution, either to fixpoints, limit-cycles, or to states of highly irregular activity. We however did not try to determine the relative incidence rates of theses three states. The two types of irregular spiking states, which are illustrated in Fig. 3, as resulting from adapting and from non-adapting dynamics, differ with respect to activity bursts (which are observed also in Fig. 2), which are conspicuously absent in our non-adapting networks.

As a note, these irregular spiking states show signs of corresponding to a transient chaotic state (see subsection Analysis of the irregular activity, in the Supplementary Material).

Evolution of balanced synaptic weights

We present in Fig. 4 the evolution of the network averages (6) of the synaptic weights. We find that the Hebbian plasticity rule (2) renormalizes the synaptic weights while approximately retaining the balance

$${f}_{exc}{\bar{w}}^{(exc)}\langle {y}_{i}^{(exc)}\rangle \approx {f}_{inh}{\bar{w}}^{(inh)}\langle {y}_{i}^{(inh)}\rangle ,\,4{\bar{w}}^{(exc)}\approx -\,{\bar{w}}^{(inh)}$$

(7)

between the mean excitatory ${\overline{w}}^{(exc)}$ and the mean inhibitory ${\overline{w}}^{(inh)}$ weights, where we have denoted with f_exc/f_inh and $\langle {y}_{i}^{(exc)}\rangle /\langle {y}_{i}^{(inh)}\rangle $ the fractions and the mean activities of excitatory and inhibitory neurons. For the present study we have $\langle {y}_{i}^{(exc)}\rangle =\langle {y}_{i}^{(inh)}\rangle ={y}_{t}$. The second relation in (7) refers to 80/20 networks, which contain four times as many excitatory as inhibitory neurons.

The balance presented in Fig. 4 is not perfect, with the inhibitory weights being slightly dominating on the long run.
We also considered networks for which the initial weight distribution was strongly not balanced, finding that the adaption rule (2) leads to balanced mean synaptic weights. We will discuss the self organization of E-I balance in more detail further below for the case of 50/50 networks.

In Fig. 5 the full distribution of synaptic weights is presented, with the results obtained from a 3600 sec simulation contrasted to the initial weight distribution. It is evident that the redistribution of synaptic weights is substantial, reaching far beyond a simple overall rescaling of the mean, as presented in Fig. 4. The excitatory weights, and to a certain extent also the inhibitory weights, tend to pile up at the pruning threshold, which has been set to zero. Trying exponential and log-normal fits we found that the excitatory weight distribution follows fairly well a log-normal distribution.

System size and simulation time effects

The comparison between networks with N = 400 and N = 3200 presented in Fig. 5 shows that the overall functional form of the weight distribution changes qualitatively for the inhibitory weights, but not for the excitatory weights. The small additional peak visible for N = 3200 for the inhibitory links corresponds to the synaptic weights of the links reinserted after pruning.

The mean weights, which are also presented in Fig. 5, scale down with increasing systems size. For the data presented in Fig. 5 the connection probability is p = 0.2 for both N = 400 and N = 3200. It is then an interesting question which kind of scaling autonomous Hebbian learning would produce. Our attempts to determine how the synaptic weights scale with respect to the mean number of afferent synapse Z = pN were however not successful. For the data presented in Fig. 5 we note that the ratio of the mean synaptic weights is about a factor two for N = 400 and N = 3200, with the corresponding ratio of Z being 1/8.

Comparing weight distributions for a fixed simulation time is not meaningful for systems, as our SOPBN, that do not stop evolving. Average weights continue to drop even for long-term simulations, as evident in part in Fig. 4. We find that the system switches to a new state (characterized either by limit cycles, fixpoints or by very long quiet periods) after extended transients, which are at least of the order of several hours. The irregular state observed, as in Fig. 3, corresponds therefore to a transient state. The transients last however orders of magnitude longer than the time scales relevant for information processing in biological networks, which range typically from milliseconds to seconds.

Self-organized balanced synaptic weights

The results presented hitherto in Figs 2, 3, 4 and 5 have been for 80/20 systems where the initial synaptic weights had been drawn from balanced distributions. Going one step further we now examine whether the Hebbian plasticity rule (2) is able to transform a non-balanced weight distribution into a balanced distribution.

We present in Fig. 6 the evolution of the synaptic weights for a 50/50 system, for which the initial synaptic weights had been drawn from Gaussians with means 7.5 (−15.0) and standard deviations 0.375 (1.5) for excitatory and inhibitory synapses, respectively. One notices that the autonomous Hebbian learning rule (2) balances the initially unbalanced synaptic weight distribution as fast as possible, that is, on the timescale 1/ε_w = 100 s. Equivalent results were obtained for initially unbalanced 80/20 systems.

The distribution of synaptic weights self-organizes, as evident from the data presented in Fig. 6, becoming fully symmetric within one hour of Hebbian adaption. The same is found for initially non-balanced 80/20 networks (not shown), for which the final synaptic weight is also balanced, albeit non-symmetric.

Would any Hebbian learning rule lead to balanced synaptic weights?

A range of distinct synaptic plasticity rules are Hebbian in the sense that they perform a principal component analysis (PCA) whenever a direction in the space of input activities presents a larger variance with respect to all other input directions³². Examples are the flux rule (2), which may be derived from the stationarity principle for statistical learning³⁶, and Oja’s rule⁴⁶,

$${\dot{w}}_{ij}={\varepsilon }_{{\rm{o}}{\rm{j}}{\rm{a}}}{y}_{i}({y}_{j}{\phi }_{j}{u}_{j}-\alpha {y}_{i}{w}_{ij}),\,\alpha =\mathrm{0.1.}$$

(8)

In order to work with average synaptic weight changes $\langle {\dot{w}}_{ij}\rangle $ of comparable magnitude, one needs to rescale the adaption rate ε_oja with respect to ε_w, which enters the flux rule (2). We use ε_oja = 10ε_w.

In Fig. 8 the time evolution of the average excitatory and inhibitory synaptic weights, as produced by Oja’s rule (8), are presented. Oja’s rule leads to a complete rescaling of the inhibitory weights and hence to a maximally unbalanced synaptic weight distribution, which is furthermore characterized by intermittent periods of abrupt changes.

Synaptic weight growth is limited by both Oja’s and by the flux rule, namely as a consequence of the additive damping factor for the case of Oja’s rule (8) and as the result of the multiplicative limiting factor G(x) = x₀ + x(1 − 2y) for the case of the flux rule (2). For comparison we performed simulations where we replaced G(x) in (2) by a constant. We find in this case that the excitatory weights are rescaled to zero. The synaptic weight distribution is therefore also maximally unbalanced. The runaway growth of the inhibitory synaptic weights showing up in Fig. 8, which is due to the removal of the limiting factor G(x) in (2), is accompanied by a respective evolution of the threshold, via (3), such that the average activity remains close to y_t = 0.2.

The flux rule (2) is manifestly only a function of the membrane potential x_i and of the effective presynaptic activity φ_ju_jy_j, which is in turn positive. The overall functional form follows closely that of a cubic polynomial³⁶,

$${\dot{w}}_{ij}\approx -\,{\varepsilon }_{w}({x}_{i}-b/2)\,({x}_{i}-{x}^{-})\,({x}_{i}-{x}^{+})\,{\phi }_{j}{u}_{j}{y}_{j},\,{x}^{\pm }\approx -\,\frac{{b}_{i}}{2}\pm {x}_{0},$$

(9)

where the x^± denote the roots of G(x) = x₀ + x(1 − 2y). Stationarity is achieved when the time average of (9) vanishes, that is when the average membrane potential 〈x_i〉 is on the order of the size of the roots x^± and b/2 of G(x)H(x).

The threshold b, which is determined via the sigmoidal (1) by the target activity y_t, is of order unity whenever this is the case for the average membrane potential 〈x_i〉.
It is viceversa true, that the average membrane potential 〈x_i〉 will be of order unity, as long as this is the case for the roots x^± and b/2 of G(x)H(x).

These two conditions are mutually compatible. It is from this point not surprising that the flux rule leads on average to small membrane potentials, as evident in Fig. 3, and consequently also to approximately balanced synaptic weight distributions. We note in contrast that Oja’s rule (8) is explicitly dependent in addition on the weight w_ij of the adapting synapse.

We conclude that not every Hebbian learning rule will produce balanced irregular dynamics. While we have pointed out here at some differences between the Flux rule and Oja’s rule, which may hint at the conditions for a rule to achieve this state, further work is necessary to determine which families of rules can and cannot perform this task.

E-I balance in terms of E-I correlations

To now quantify the degree of balance between excitation and inhibition, we compute for a given neuron the cross-correlation ${C}_{i}^{\pm }$ between the total excitatory incoming synaptic current ${x}_{i}^{(exc)}$, as defined by (5), and the total inhibitory synaptic current ${x}_{i}^{(inh)}$, averaged first with respect to time and then across all neurons of the network:

$${C}_{i}^{\pm }=\frac{{\langle ({x}_{i}^{(exc)}-{\langle {x}_{i}^{(exc)}\rangle }_{t})({x}_{i}^{(inh)}-{\langle {x}_{i}^{(inh)}\rangle }_{t})\rangle }_{t}}{{\sigma }_{i}^{(exc)}{\sigma }_{i}^{(inh)}},\,{\rho }^{\pm }={\langle {C}_{i}^{\pm }\rangle }_{i}.$$

(10)

Here we have denoted with ${\sigma }_{i}^{(exc)}$ and ${\sigma }_{i}^{(inh)}$ the standard deviations of ${x}_{i}^{(exc)}$ and respectively of ${x}_{i}^{(inh)}$.

In Fig. 7 we present the cross correlation |ρ^±| for the 50/50 system discussed in Fig. 6, for which the initial weight configurations are not balanced. Note that the time scale for Hebbian learning is 1/ε_w = 100 sec, which is a order of magnitude larger than the interval of 10 sec used for evaluating ρ^± via (10). Analogous investigations for an 80/20 system can be found in the Supplementary Material in Fig. S3.

The cross correlation characterizing the E-I balance of the initial state is only marginally dependent on whether short-term and/or intrinsic plasticity are active. Its surprisingly large overall value, about (45–50)%, reflects the presence of substantial inter-neuronal activity correlations, which we did not investigate further. Comparing with the data presented in Fig. 6 one notices that ρ^± is a somewhat less sensible yardstick for E-I balance than the bare synaptic weight balance, which renormalizes to small values in a balanced state. The data shown in Fig. 7 confirms otherwise that the Hebbian plasticity rule (2) leads to a highly balanced state.

We have so far considered here networks without any external noise, which would not be the case in the brain. A state characterized by irregular neural activity is generically expected to be robust against moderate noise levels. Performing simulations with additive input noise, characterized by zero means and a standard deviation of (5–10)%, with respect to the mean of the bare input, we found this expectation to hold. The cross correlation ρ^± barely changes as long as the level of noise present remains moderate. The situation changes gradually with increasing noise strength, with E-I balance breaking down when the noise level reaches about 50% of the bare input strength (cf. Fig. S1 in the Supplementary Material).

Discussion

We have examined here the question of whether it would be plausible for a neural network in which both intrinsic and synaptic (E as well as I connections) parameters are continuously evolving to achieve balance both in terms of weights and activities, in a fully unsupervised way, finding that this is indeed possible. The resulting balanced network (which we have denoted here SOPBN) arises in a self-organized fashion, in analogy to the critical state characterizing possibly certain aspects of cortical dynamics⁴⁷. We studied for this purpose the influence of continuously ongoing Hebbian plasticity within autonomous networks of rate-encoding neurons, finding that the synaptic plasticity rule that follows from the stationarity principle of statistical learning, the flux rule, does indeed induce a balanced synaptic weight distribution, even when the initial distribution is strongly unbalanced.

E-I balance induced by Hebbian learning

Comparing the flux rule with and without the self-limiting term and Oja’s rule, we have found that Hebbian learning leads to a balanced distribution of synaptic weights, and hence also to a balanced state, whenever the learning rule favors small average membrane potentials. It is not necessary, for this to happen, that the learning rule constrains the overall input to strictly vanish on average, it suffices that the time averaged input remains of the order of the neural parameters, such as the inverse slope of the transfer function in (1). We found that the flux rule, as defined by (2) and (9), fulfills this requirement. An example of a Hebbian rule not leading to a balanced weight distribution is on the other side given by Oja’s rule (8).

Rate encoding neurons showing spike-like neural activity

An E-I balanced state is characterized in addition to the small average membrane potential by the near cancellation of two large drivings in the form of large excitatory and inhibitory inputs. Such a state is highly sensible to small imbalances resulting either from additional external signals or from internal fluctuations. We find these imbalances to be strong enough in SOPBNs to induce short spike-like bursts in the neural activity, as observed e.g. in Fig. 2. This is quite remarkable, as one could have expected that the rate-encoding neurons used for the present study would be more likely to lead to slowly and hence to smoothly varying dynamical states.

Asynchronous neural activity

The near cancellation of large excitatory and inhibitory drivings stabilizes asynchronous neural activity, as illustrated in Fig. 3 in terms of the membrane potential. Using the 0–1 test for chaos⁴⁸ we found the asynchronous state in SOPBNs to be at least strongly irregular (cf. Fig. S2 in the Supplementary Material). As indicators for chaos one may have analyzed the time intervals between activity spikes⁴⁹ or the Lyapunov exponents of the system. The observation that the synaptic weight distribution changes continuously, as demonstrated in Fig. 6, over time scales of hours, proves in any case that the neural activity is irregular on extended times scales. The limit of infinitely long times is not the focus of this study, as real neural systems are not expected to function for prolonged periods in the absence of stimuli.

Absence of a stationary autonomous state

We find, as shown in Fig. 4, that the size of the mean synaptic weights decays slowly but continuously. Experimenting with different ensembles of initial weight statistics we found no instance where Hebbian learning retaining E-I balance would lead to a systematic increase in magnitude of the overall mean synaptic weights. We note, however, that this observation holds only for the here considered case of isolated networks, hence without an additional external driving. An adaption rate ε_w that would fade out slowing, being only initially large, would also preempt the long term decay of average synaptic weights.

Theory vs. experiment

The dynamic balance of excitation and inhibition is observed experimentally within a range of distinct settings^1,5. Multielectrode recordings in human and monkey neocortex suggests that E-I balance is caused in essence by local recurrent activity⁵⁰, and not by external inputs, with irregular bursting activity showing up on a range of time scales that starts, as for SOPBNs, at a few hundred milliseconds. It is also interesting that the independent adjustment of synapses connecting inhibitory to layer 2/3 pyramidal neurons in the mouse primary visual cortex has been found to be key for E-I balance to occur on a single-neuron level⁵¹. These findings concur with the results for the single neuron cross correlation presented in Fig. 7, for which the network average has been performed only as a second step. Furthermore we note that both the self organized bursting states observed in SOPBNs, see Fig. 6, and the alternating up and down states observed for in vitro prefrontal and occipital ferret slices are characterized by the asynchronous participation of all neurons².

Outlook

Which configuration of synaptic weights results from continuously ongoing internal Hebbian learning? We presented here a first inroad into this subject, focusing in particular on the self-organized emergence of E-I balance in terms of large but nearly canceling excitatory and inhibitory inputs. We find that not all self-limiting Hebbian plasticity rules are able to do the job. There is on the other hand no need for a Hebbian learning rule to enforce E-I balance explicitly. We find that E-I balance already emerges when the Hebbian learning rule favors membrane potentials which are small with respect to the variance of the inputs, being nevertheless large enough to be relevant for the neural transfer function.

References

Sanchez-Vives, M. V. & McCormick, D. A. Cellular and network mechanisms of rhythmic recurrent activity in neocortex. Nature neuroscience 3, 1027 (2000).
Article PubMed CAS Google Scholar
Shu, Y., Hasenstaub, A. & McCormick, D. A. Turning on and off recurrent balanced cortical activity. Nature 423, 288–293 (2003).
Article PubMed ADS CAS Google Scholar
Tahvildari, B., Wölfel, M., Duque, A. & McCormick, D. A. Selective functional interactions between excitatory and inhibitory cortical neurons and differential contribution to persistent activity of the slow oscillation. Journal of Neuroscience 32, 12165–12179 (2012).
Article PubMed PubMed Central CAS Google Scholar
Atallah, B. V. & Scanziani, M. Instantaneous modulation of gamma oscillation frequency by balancing excitation with inhibition. Neuron 62, 566–577 (2009).
Article PubMed PubMed Central CAS Google Scholar
Haider, B., Duque, A., Hasenstaub, A. R. & McCormick, D. A. Neocortical network activity in vivo is generated through a dynamic balance of excitation and inhibition. Journal of Neuroscience 26, 4535–4545 (2006).
Article PubMed CAS Google Scholar
Okun, M. & Lampl, I. Instantaneous correlation of excitation and inhibition during ongoing and sensory-evoked activities. Nature neuroscience 11, 535–537 (2008).
Article PubMed CAS Google Scholar
Taub, A. H., Katz, Y. & Lampl, I. Cortical balance of excitation and inhibition is regulated by the rate of synaptic activity. Journal of Neuroscience 33, 14359–14368 (2013).
Article PubMed CAS Google Scholar
Van Vreeswijk, C. & Sompolinsky, H. Chaos in neuronal networks with balanced excitatory and inhibitory activity. Science 274, 1724–1726 (1996).
Article PubMed ADS Google Scholar
Van Vreeswijk, C. & Sompolinsky, H. Chaotic balanced state in a model of cortical circuits. Neural computation 10, 1321–1371 (1998).
Article PubMed Google Scholar
Shadlen, M. N. & Newsome, W. T. The variable discharge of cortical neurons: implications for connectivity, computation, and information coding. Journal of neuroscience 18, 3870–3896 (1998).
Article PubMed CAS Google Scholar
Poil, S.-S., Hardstone, R., Mansvelder, H. D. & Linkenkaer-Hansen, K. Critical-state dynamics of avalanches and oscillations jointly emerge from balanced excitation/inhibition in neuronal networks. Journal of Neuroscience 32, 9817–9823 (2012).
Article PubMed PubMed Central CAS Google Scholar
Haldeman, C. & Beggs, J. M. Critical branching captures activity in living neural networks and maximizes the number of metastable states. Physical review letters 94, 058101 (2005).
Article PubMed ADS CAS Google Scholar
Beggs, J. M. & Plenz, D. Neuronal avalanches are diverse and precise activity patterns that are stable for many hours in cortical slice cultures. Journal of neuroscience 24, 5216–5229 (2004).
Article PubMed CAS Google Scholar
Legenstein, R. & Maass, W. Edge of chaos and prediction of computational performance for neural circuit models. Neural Networks 20, 323–334 (2007).
Article PubMed MATH Google Scholar
Renart, A. et al. The asynchronous state in cortical circuits. Science 327, 587–590 (2010).
Article PubMed PubMed Central ADS CAS Google Scholar
Ecker, A. S. et al. Decorrelated neuronal firing in cortical microcircuits. Science 327, 584–587 (2010).
Article PubMed ADS CAS Google Scholar
Ostojic, S. Two types of asynchronous activity in networks of excitatory and inhibitory spiking neurons. Nature neuroscience 17, 594–600 (2014).
Article PubMed CAS Google Scholar
Vogels, T. P. & Abbott, L. Gating multiple signals through detailed balance of excitation and inhibition in spiking networks. Nature neuroscience 12, 483–491 (2009).
Article PubMed PubMed Central CAS Google Scholar
Rubenstein, J. & Merzenich, M. M. Model of autism: increased ratio of excitation/inhibition in key neural systems. Genes, Brain and Behavior 2, 255–267 (2003).
Article CAS Google Scholar
Yizhar, O. et al. Neocortical excitation/inhibition balance in information processing and social dysfunction. Nature 477, 171–178 (2011).
Article PubMed PubMed Central ADS CAS Google Scholar
Ahmadian, Y., Rubin, D. B. & Miller, K. D. Analysis of the stabilized supralinear network. Neural computation 25, 1994–2037 (2013).
Article MathSciNet PubMed PubMed Central Google Scholar
Rubin, D. B., Van Hooser, S. D. & Miller, K. D. The stabilized supralinear network: a unifying circuit motif underlying multi-input integration in sensory cortex. Neuron 85, 402–417 (2015).
Article PubMed PubMed Central CAS Google Scholar
Hennequin, G., Ahmadian, Y., Rubin, D. B., Lengyel, M. & Miller, K. D. Stabilized supralinear network dynamics account for stimulus-induced changes of noise variability in the cortex. bioRxiv 094334 (2016).
Hennequin, G., Vogels, T. P. & Gerstner, W. Optimal control of transient dynamics in balanced networks supports generation of complex movements. Neuron 82, 1394–1406 (2014).
Article PubMed CAS Google Scholar
Hennequin, G., Agnes, E. J. & Vogels, T. P. Inhibitory plasticity: Balance, control, and codependence. Annual Review of Neuroscience 40, 557–579 (2017).
Article PubMed CAS Google Scholar
Vogels, T. P., Sprekeler, H., Zenke, F., Clopath, C. & Gerstner, W. Inhibitory plasticity balances excitation and inhibition in sensory pathways and memory networks. Science 334, 1569–1573 (2011).
Article PubMed ADS CAS Google Scholar
Bliss, T. V. & Lømo, T. Long-lasting potentiation of synaptic transmission in the dentate area of the anaesthetized rabbit following stimulation of the perforant path. The Journal of physiology 232, 331–356 (1973).
Article PubMed PubMed Central CAS Google Scholar
Alger, B. & Teyler, T. Long-term and short-term plasticity in the ca1, ca3, and dentate regions of the rat hippocampal slice. Brain research 110, 463–480 (1976).
Article PubMed CAS Google Scholar
Bi, G.-Q. & Poo, M.-M. Synaptic modifications in cultured hippocampal neurons: dependence on spike timing, synaptic strength, and postsynaptic cell type. Journal of neuroscience 18, 10464–10472 (1998).
Article PubMed CAS Google Scholar
Desai, N. S., Rutherford, L. C. & Turrigiano, G. G. Plasticity in the intrinsic excitability of cortical pyramidal neurons. Nature Neuroscience 2, 515–520 (1999).
Article PubMed CAS Google Scholar
Castillo, P. E., Chiu, C. Q. & Carroll, R. C. Long-term plasticity at inhibitory synapses. Current opinion in neurobiology 21, 328–338 (2011).
Article PubMed PubMed Central CAS Google Scholar
Echeveste, R. & Gros, C. Generating functionals for computational intelligence: The fisher information as an objective function for self-limiting hebbian learning rules. Frontiers in Robotics and AI 1, 1 (2014).
Article Google Scholar
Priebe, N. J., Mechler, F., Carandini, M. & Ferster, D. The contribution of spike threshold to the dichotomy of cortical simple and complex cells. Nature neuroscience 7, 1113–1122 (2004).
Article PubMed PubMed Central CAS Google Scholar
Priebe, N. J. & Ferster, D. Inhibition, spike threshold, and stimulus selectivity in primary visual cortex. Neuron 57, 482–497 (2008).
Article PubMed CAS Google Scholar
Miller, K. D. & Troyer, T. W. Neural noise can explain expansive, power-law nonlinearities in neural response functions. Journal of neurophysiology 87, 653–659 (2002).
Article PubMed Google Scholar
Echeveste, R., Eckmann, S. & Gros, C. The fisher information as a neural guiding principle for independent component analysis. Entropy 17, 3838–3856 (2015).
Article MathSciNet MATH ADS Google Scholar
Turrigiano, G. G. & Nelson, S. B. Hebb and homeostasis in neuronal plasticity. Current opinion in neurobiology 10, 358–364 (2000).
Article PubMed CAS Google Scholar
Triesch, J. Synergies between intrinsic and synaptic plasticity in individual model neurons. In Advances in neural information processing systems, 1417–1424 (2005).
Markovic, D. & Gros, C. Self-organized chaos through polyhomeostatic optimization. Physical Review Letters 105, 068702 (2010).
Article PubMed ADS CAS Google Scholar
Marković, D. & Gros, C. Intrinsic adaptation in autonomous recurrent neural networks. Neural Computation 24, 523–540 (2012).
Article MathSciNet PubMed Google Scholar
Regehr, W. G. Short-term presynaptic plasticity. Cold Spring Harbor perspectives in biology 4, a005702 (2012).
Article PubMed PubMed Central CAS Google Scholar
Mongillo, G., Barak, O. & Tsodyks, M. Synaptic theory of working memory. Science 319, 1543–1546 (2008).
Article PubMed ADS CAS Google Scholar
Wang, Y. et al. Heterogeneity in the pyramidal network of the medial prefrontal cortex. Nature neuroscience 9, 534–542 (2006).
Article PubMed ADS CAS Google Scholar
Gupta, A., Wang, Y. & Markram, H. Organizing principles for a diversity of gabaergic interneurons and synapses in the neocortex. Science 287, 273–278 (2000).
Article PubMed ADS CAS Google Scholar
Martin, L., Sándor, B. & Gros, C. Closed-loop robots driven by short-term synaptic plasticity: Emergent explorative vs. limit-cycle locomotion. Frontiers in neurorobotics 10, 12 (2016).
Article PubMed PubMed Central Google Scholar
Oja, E. The nonlinear pca learning rule in independent component analysis. Neurocomputing 17, 25–45 (1997).
Article Google Scholar
Marković, D. & Gros, C. Power laws and self-organized criticality in theory and nature. Physics Reports 536, 41–74 (2014).
Article MathSciNet ADS Google Scholar
Wernecke, H., Sándor, B. & Gros, C. How to test for partially predictable chaos. Scientific Reports 7, 1087 (2017).
Article PubMed PubMed Central ADS CAS Google Scholar
Echeveste, R. & Gros, C. Drifting states and synchronization induced chaos in autonomous networks of excitable neurons. Frontiers in computational neuroscience 10, 98 (2016).
Article PubMed PubMed Central Google Scholar
Dehghani, N. et al. Dynamic balance of excitation and inhibition in human and monkey neocortex. Scientific reports 6, 23176 (2016).
Article PubMed PubMed Central ADS CAS Google Scholar
Xue, M., Atallah, B. V. & Scanziani, M. Equalizing excitation–inhibition ratios across visual cortical neurons. Nature 511, 596 (2014).
Article PubMed PubMed Central ADS CAS Google Scholar

Download references

Acknowledgements

We thank Fabian Schubert for discussions.

Author information

Authors and Affiliations

Johann-Wolfgang-Goethe University Frankfurt, Frankfurt, 60323, Germany
Philip Trapp & Claudius Gros
Computational and Biological Learning Lab, Dept. of Engineering, University of Cambridge, Cambridge, UK
Rodrigo Echeveste

Authors

Philip Trapp
View author publications
You can also search for this author in PubMed Google Scholar
Rodrigo Echeveste
View author publications
You can also search for this author in PubMed Google Scholar
Claudius Gros
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

P.T. performed the simulations, R.E. contributed to the discussions, with the manuscript being written by C.G.

Corresponding author

Correspondence to Claudius Gros.

Ethics declarations

Competing Interests

The authors declare no competing interests.

Additional information

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Supplementary Material

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Trapp, P., Echeveste, R. & Gros, C. E-I balance emerges naturally from continuous Hebbian learning in autonomous neural networks. Sci Rep 8, 8939 (2018). https://doi.org/10.1038/s41598-018-27099-5

Download citation

Received: 08 November 2017
Accepted: 24 May 2018
Published: 12 June 2018
DOI: https://doi.org/10.1038/s41598-018-27099-5

This article is cited by

Population and individual firing behaviors in sparsely synchronized rhythms in the hippocampal dentate gyrus
- Sang-Yoon Kim
- Woochang Lim
Cognitive Neurodynamics (2022)
Evolutionary conservation and divergence of the human brain transcriptome
- William G. Pembroke
- Christopher L. Hartl
- Daniel H. Geschwind
Genome Biology (2021)
Inhibitory control in neuronal networks relies on the extracellular matrix integrity
- Egor Dzyubenko
- Michael Fleischer
- Dirk M. Hermann
Cellular and Molecular Life Sciences (2021)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.