Abstract
Spontaneous brain activity is characterized in part by a balanced asynchronous chaotic state. Cortical recordings show that excitatory (E) and inhibitory (I) drivings in the EI balanced state are substantially larger than the overall input. We show that such a state arises naturally in fully adapting networks which are deterministic, autonomously active and not subject to stochastic external or internal drivings. Temporary imbalances between excitatory and inhibitory inputs lead to large but shortlived activity bursts that stabilize irregular dynamics. We simulate autonomous networks of rateencoding neurons for which all synaptic weights are plastic and subject to a Hebbian plasticity rule, the flux rule, that can be derived from the stationarity principle of statistical learning. Moreover, the average firing rate is regulated individually via a standard homeostatic adaption of the bias of each neuron’s inputoutput nonlinear function. Additionally, networks with and without shortterm plasticity are considered. EI balance may arise only when the mean excitatory and inhibitory weights are themselves balanced, modulo the overall activity level. We show that synaptic weight balance, which has been considered hitherto as given, naturally arises in autonomous neural networks when the here considered selflimiting Hebbian synaptic plasticity rule is continuously active.
Introduction
It is well established that a balance between excitation and inhibition, usually denoted as EI balance, arises during spontaneous cortical activity, both in vitro^{1,2,3,4} and in the intact and spontaneously active cortex^{4,5,6,7}. This balance, which refers to a relatively constant ratio between excitatory and inhibitory inputs to a neuron, has been theoretically predicted as way to explain how cortical networks are able to sustain stable though temporally irregular, and even chaotic, dynamics^{8,9,10}. Since then, the ramifications of such a balanced state in terms of both dynamics and computation have been widely studied, showing how EI balance results in criticalstate dynamics of avalanches and oscillations^{11}, with direct implications for the dynamic range^{12}, storage of information^{13}, and computational power^{14} of networks.
Recurrent neural networks can use EI balance to generate asynchronous states even in the presence of strongly shared inputs^{15}. Indeed, nearby cortical neurons with similar orientation tuning show low correlated variability, potentially simplifying the decoding of information by a population of such neurons^{16}. Balanced networks have also been shown to work potentially in at least two different regimes, linking richness of the internal dynamics, connectivity strength, and functionality: a weak coupling state favoring information transmission, and a strongly coupled state, characterized by complex internal dynamics which could be employed for information processing^{17}. Modulating the ratio between excitation and inhibition it is furthermore possible to selectively switch information gating and rerouting between different circuits on and off^{18}.
The direct link between EI balance and information transmission, together with observations of an atypical ratio of excitation/inhibition in neurobehavioral syndromes such as autism, has led to the hypothesis that an abnormal degree of EI balance might be behind a series of psychiatric disorders^{19}. Indeed, later causal experimental studies in mice have shown how further elevation of EI balance, above typical physiological levels, produce a strong impairment of information processing and result in social deficits consistent with those of humans suffering from these conditions^{20}.
It has been shown that networks of supralinear excitatory and inhibitory neurons, namely of neurons whose nonlinearities are purely expansive (no saturation) and which would therefore tend to exhibit unstable behavior, can be stabilized choosing the right type of connectivity matrices, resulting in stabilized loosely balanced dynamics^{21}. These networks, denoted stabilized supralinear networks (SSN), are able to capture a wide range of experimental findings of visual cortical neurons including contextual modulation and normalization, spatial properties of intracortical connections^{22}, as well as stimulus dependence of neural variability^{23}.
Different approaches have been taken in the past to construct balanced neural networks for numerical simulations. When van Vreeswijk and Sompolinsky introduced the balanced network model^{8} they constructed the connectivity matrix using sparse random connections, where the overall connection strength was forced to be inversely proportional to the square root of the number of connections. The conditions for stability of the (SSN) have also been studied analytically^{21}, and the weights in these type of networks are typically selected so that the network is stable^{22,23}. Balance has also been a topic of study in nonchaotic networks designed for generation of complex movement. Termed “stabilityoptimized circuits” (SOCs)^{24}, in these networks balance is achieved by an optimizer modifying inhibitory connections, together with a mechanism able to prune or add new synapses.
These approaches did not however tackle the issue of how the brain could find those weight configurations. In particular, a key question is whether EI balance in the brain is the result of genetically encoded synaptic strengths or, alternatively, whether ongoing internal synaptic adaption may lead to a dynamic configuration of balanced synaptic weights. In other words, whether the distribution of synaptic weights is a priori given or the result of a selforganizing process.
Several studies in recent years have proposed the use of inhibitory synaptic plasticity (ISP) to attain balance (see^{25}, for a comprehensive review). Within those, an important step towards the understanding of how balance can be achieved in an unsupervised fashion in the brain has been the work of Vogels et al.^{26}, who have shown in simulations how single neurons with constant (or controlled) E weights and plastic I weights, receiving an external stimulus can attain balance, and also restore it after the excitatory synaptic weights are modified. Moreover the authors also show how balance can be attained in random recurrent networks where only the connections from I cells to E cells are plastic via ISP.
Both excitatory and inhibitory connections, as well as the overall excitability of neurons are plastic and constantly evolve together with the neural activity in the brain, with a variety of plasticity mechanisms operating at different timescales^{27,28,29,30,31}. The key question we address here is whether it would be possible for balance to be attained under these conditions in the brain, in a completely unsupervised way. We find here that this is indeed a plausible option. Namely, we show how simulated networks of nonlinear excitatory and inhibitory neurons, evolving autonomously under local, online intrinsic and synaptic plasticity rules, generically achieve states which are balanced both with respect to the distribution of the synaptic weights and with respect to the inputs individual neurons receive. Hence we refer to this type of networks with continuously evolving synaptic and intrinsic parameters as Selforganized Plastic Balanced Network (SOPBN). No external optimization is here employed and the procedure is shown to be robust also to the addition of external noise.
Methods
We consider autonomous ErdösRényi networks containing N neurons characterized by a linking probability p. The membrane potential x_{ i } of the rateencoding neurons obeys
where y_{ i } is the firing rate, b_{ i } the threshold and w_{ ij } are the internal synaptic weights. There is no external input. In particular, no external source of noise is present in the main analysis of the system (we show in the Supplementary Material how these results are robust to the addition of a finite amount of external noise). The membrane time constant τ is set to 10 ms for inhibitory and respectively to 20 ms for excitatory neurons.
The neural model we employ is described by a nonlinear relation between membrane potentials and firing rates and has been used in previous work^{32} to derive the Hebbian plasticity rules we will later employ. This transformation is expansive for low firing rates and saturates for very high rates. While a saturation of this type is unavoidable for any realistic biological system, cortical neurons have always been observed to behave in the low firing rate regime, where this saturation is not visible, and the transfer function is typically described by a thresholdpowerlaw \(y\propto {\lfloor x\rfloor }^{n}\) with exponent n between 1 and 5^{33,34,35}. We show however in Fig. 1 how, for low firing rates (encouraged by the intrinsic plasticity rule we employ) both functions are virtually indistinguishable.
Adaption of the synaptic weights
The recurrent synaptic weights are continuously adapted using the multiplicative selflimiting Hebbian rule^{32}
where the membrane potential x_{ i } and the activity y_{ i } of the postysynaptic neuron are related in this model via (1) by a deterministic sigmoidal transfer function. This allows us to write functions G and H as functions of x_{ i } only, where y_{ i } is then simply shorthand for y_{ i }(x_{ i }). This update rule may be derived from an information theoretical principle, the stationarity principle for statistical learning^{36}, which states that the distribution function of the postsynaptic neural activity continuously evolves during the weight adaption process, becoming stationary only once learning is completed. Being autonomous the network considered here is however not confronted with an explicit learning task. Learning denotes in our context therefore the unsupervised process of weight adaption, which minimizes in our case the the Fisher information of the activity of the postsynaptic neuron^{32}.
The limiting term G(x) in (2) changes sign when the postsynaptic activity y_{ i } is either too large or too small in comparison with x_{0}, reversing hence the Hebbian learning regulated in turn by H(x). This property of G(x) is useful for the learning rule as it prevents runaway synaptic growth, operating as an effective homeostatic synaptic plasticity mechanism, mounted on top of the Hebbian part of the rule^{37}. Our adaption rule, which is also denoted flux rule^{32}, is robust with respect to the actual value selected for the references scale x_{0} of the membrane potential, as we checked performing test runs with x_{0} = 1 and x_{0} = 8. For the simulations presented here we used x_{0} = 4.
We note that Hebbian learning rules like (2) are normally formulated not with respect to the bare presynaptic activities, but with respect to the deviation δy_{ j } = y_{ j } − 〈y_{ j }〉 of the presynaptic activity y_{ j } with respect to its timeaveraged mean〈y_{ j }〉. The adaption rule (2) performs in that case a principal component analysis for which the signaltonoise ratio increases with increasing x_{0}^{32}, being otherwise sensible to input directions y_{ j } characterized by a negative excess kurtosis.
For the study presented further below we use the same adaption rule for all synapses, namely (2), whose selflimiting behavior stabilizes firing rates, rather than trying to reproduce a particular instance of the wide variety of experimentally observed phenomenological spike time dependent synaptic plasticity (STDP) rules for inhibitory connections^{25}. This route would involve therefore the introduction of not wellconstrained parameters, transcending in addition the central aims of our investigation. We are interested here to investigate if ongoing Hebbian plasticity and balanced asynchronous dynamics are compatible.
The threshold b_{ i } = b_{ i }(t) entering the transfer function in (1) sets, as usual, the average firing rates. Here we use
for the adaption rule for the threshold, which reduces, for y ≈ y_{ t } = 0.2, to the somewhat extended expressions one may derive from homeostatic principles for neural activity^{38,39,40}. For the adaption rates we used 1/ε_{ b } = 10 and 1/ε_{ w } = 100 (in seconds).
Synaptic pruning
Dale’s law states that neurons are either excitatory or inhibitory, namely that w_{ lj }w_{ kj } ≥ 0 for all l and k. For a Hebbian plasticity rule like (2) to respect Dale’s law one needs to prune a synaptic connection whenever the respective w_{ ij } changes sign. We do this every 1000 ms of mathematical simulation time, reinserting the pruned link with a weight corresponding to 10% of the correspondingly average excitatory or inhibitory links. Performing test runs where the pruned links were reinserted with a strength of 1% of the average mean yielded nearly identical results. For the reinsertion process the postsynaptic neuron i is connected to a random and previously unconnected presynaptic neuron m, with the sign of the new link w_{ im } respecting Dale’s law. There are two possible versions.
Annealed pruning
Links may change sign when the new presynaptic neuron m is selected freely. The overall number of excitatory and inhibitory links may then drift over the course of the simulation, with only the total connectivity remaining constant.
Frozen pruning
Links do not change in character when the new presynaptic neuron m is selected only among those neurons which are of the same type as j. Frozen pruning would correspond from a biological perspective to a separate reshuffling of Gaba and Glutamate receptors.
For the results presented here we considered frozen pruning.
Shortterm synaptic plasticity
We also included shortterm plasticity (STSP), a mostly presynaptically induced modulation of the synaptic efficacy lasting hundreds of milliseconds to seconds^{41}. STSP may lead both to synaptic potentiation and depression, resulting respectively from an influx of Ca^{2+} ions into the presynaptic bulb and from a depletion of the available reservoir of neurotransmitters. These effects are captured within the TsodyksMarkram model^{42} by two variables, u(t) and φ(t), encoding respectively the presynaptic Ca^{2+}concentration and the number of vesicles with neurotransmitters. The transient plasticity rules
then describe the time evolution of the effective synaptic weight \({\tilde{w}}_{ij}\) which is proportional to the bare synaptic weight w_{ ij }, to the number of available vesicles φ_{ j } and to the vesicle’s release probability u_{ j }. In simulations where STSP is present, \({\tilde{w}}_{ij}\) replaces w_{ ij } in (1). STSP is transient in the sense that both u_{ j } and φ_{ j } relax to unity in the absence of presynaptic activity y_{ j } → 0. Typical time evolution curves for the synaptic efficiency multiplier φ_{ j }(t)u_{ j }(t) are presented in Fig. 1.
With the introduction of STSP and making an explicit distinction between E and I inputs, the driving current \({x}_{i}^{(inp)}\) defined in (1) is then generalized to
where {exc} and {inh} denote respectively the set of excitatory and inhibitory neurons. One can define analogously with
the average excitatory and inhibitory effective synaptic weights.
We note that the original TsodyksMarkram model^{42} describes STSP for the case of spiking neurons and that one can derive (4) by assuming α = β = 0.01 and that a maximal neural activity of y_{ j } → 1 corresponds to a firing rate of 40 Hz. Typical values for the time scales entering (4) are T_{ u } = 500 ms and T_{ φ } = 200 ms for excitatory synapses in the medial prefrontal cortex of ferrets^{43} and T_{ u } = 20 ms and T_{ φ } = 700 ms for inhibitory layer 2–4 neurons of the somatosensory cortex of Wistar rats^{44}. It has been pointed out, that these time scales are also relevant for behavioral control tasks^{45}.
For our simulations we used U_{ max } = 4, α = β = 0.01, T_{ u } = 500 ms and T_{ φ } = 200 ms for all synapses. We did also run control runs involving 500/200 and 20/700 T_{ u }/T_{ φ } pairs respectively for excitatory and inhibitory synapses, which led however only to minor quantitative changes.
Results
We are interested in investigating under which conditions an autonomous neural network, whose dynamics is described by (1), (2), (3) and (4), evolves towards a stable, irregular and balanced state (SOPBN). The results here presented correspond to networks of both excitatory and inhibitory neurons, where 80% of neurons are excitatory and 20% are inhibitory, and whose connections respect Dale’s principle, even when plasticity mechanisms are at play. We have taken membrane time constants of 20 and 10 ms for excitatory and inhibitory cells, respectively. As checks, we have also repeated the simulations with networks consisting of 50% excitatory and 50% inhibitory neurons and with equal membrane time constants, observing no qualitative differences. Unless otherwise stated, we will present results with a total number of neurons N = 400, a fixed 80% fraction of excitatory cells, a link probability p = 0.2 and a target average activity of y_{ t } = 0.2. The initial synaptic weights are drawn from Gaussians with means 7.5 (−30.0) and standard deviations 0.375 (1.5) for excitatory and inhibitory synapses, respectively. Our simulations were performed in all cases with a C++ code running on a standard desktop computer.
Rate encoding neurons with asynchronous activity spikes
We find that the SOPBN tends to evolve to an irregularly bursting state characterized by time scales of the order of 100–200 ms. The data presented in Fig. 2 illustrates typical two second intervals of activity, as obtained directly at initialization and after one hour of mathematical simulation time. It shows the following:

The system state is very different at the beginning and after one hour: While some neurons are constantly quiet or active directly after initialization, the network exhibits pervading bursts after evolving for one hour.

The mean excitatory \(\langle {x}_{i}^{(exc)}\rangle \) and inhibitory \(\langle {x}_{i}^{(inh)}\rangle \) inputs a neuron receives are both large in magnitude. The substantially smaller value for the overall mean input expresses EI balance. This EI balance is present for arbitrary timeframes within the systems evolution. Averaged over time we have
$$\begin{array}{c}{\langle {x}_{i}^{(exc)}\rangle }_{t=0{\rm{s}}}\approx 144.8,\,{\langle {x}_{i}^{(inh)}\rangle }_{t=0{\rm{s}}}\approx \,147.3,\,{\langle {x}_{i}^{(exc)}\rangle }_{t=0{\rm{s}}}+{\langle {x}_{i}^{(inh)}\rangle }_{t=0{\rm{s}}}\approx \,2.5,\\ {\langle {x}_{i}^{(exc)}\rangle }_{t=1{\rm{h}}}\approx 41.9,\,{\langle {x}_{i}^{(inh)}\rangle }_{t=1{\rm{h}}}\approx \,44.1,\,{\langle {x}_{i}^{(exc)}\rangle }_{t=1{\rm{h}}}+{\langle {x}_{i}^{(inh)}\rangle }_{t=1{\rm{h}}}\approx \,2.2,\end{array}$$for the system at different times where the brackets denote now averages over the network and over time.

Deviations from the average EI balance lead to large swings in the membrane potentials and hence to sharp activity spikes. A remarkable feature for rateencoding neurons evolving with (1) continuously in time.

Bursts in the late network state involve the entire network. All excitatory and inhibitory neurons are active one or more times during a burst, as we have checked. The activities of the individual neurons are however asynchronous.
We also examined the EI balance \({x}_{i}^{(exc)}+{x}_{i}^{(inh)}\) for individual neurons, obtaining results very close to the network averages shown in Fig. 2. A detailed analysis of the corresponding cross correlations is presented further below.
In^{21} the authors compare the degree of cancellation (or tightness of the balance) between the van Vreeswijk and Sompolinsky balanced networks, and the SSN, showing that while the first kind requires a very high degree of cancellation, the SSN can operate in a regime of loose balance. These networks have however constant synaptic weights and intrinsic parameters. We observe in SOPBNs, where several parameters are plastic, that while most of the time the network follows a high degree of balance (with correlations close to unity as shown in Fig. 7), this tightness is transiently broken to allow for bursts of activity.
Autonomous networks with balanced and increasingly large, but otherwise random synaptic weight distributions, are known to produce a chaotic state in the thermodynamic limit^{9}. Testing this prediction we considered the nonadapting case with ε_{ b } = ε_{ w } = 0. By additionally switching off shortterm synaptic plasticity, we find that a N = 400 network leads, depending on the initial weight distribution, either to fixpoints, limitcycles, or to states of highly irregular activity. We however did not try to determine the relative incidence rates of theses three states. The two types of irregular spiking states, which are illustrated in Fig. 3, as resulting from adapting and from nonadapting dynamics, differ with respect to activity bursts (which are observed also in Fig. 2), which are conspicuously absent in our nonadapting networks.
As a note, these irregular spiking states show signs of corresponding to a transient chaotic state (see subsection Analysis of the irregular activity, in the Supplementary Material).
Evolution of balanced synaptic weights
We present in Fig. 4 the evolution of the network averages (6) of the synaptic weights. We find that the Hebbian plasticity rule (2) renormalizes the synaptic weights while approximately retaining the balance
between the mean excitatory \({\overline{w}}^{(exc)}\) and the mean inhibitory \({\overline{w}}^{(inh)}\) weights, where we have denoted with f_{ exc }/f_{ inh } and \(\langle {y}_{i}^{(exc)}\rangle /\langle {y}_{i}^{(inh)}\rangle \) the fractions and the mean activities of excitatory and inhibitory neurons. For the present study we have \(\langle {y}_{i}^{(exc)}\rangle =\langle {y}_{i}^{(inh)}\rangle ={y}_{t}\). The second relation in (7) refers to 80/20 networks, which contain four times as many excitatory as inhibitory neurons.

The balance presented in Fig. 4 is not perfect, with the inhibitory weights being slightly dominating on the long run.

We also considered networks for which the initial weight distribution was strongly not balanced, finding that the adaption rule (2) leads to balanced mean synaptic weights. We will discuss the self organization of EI balance in more detail further below for the case of 50/50 networks.
In Fig. 5 the full distribution of synaptic weights is presented, with the results obtained from a 3600 sec simulation contrasted to the initial weight distribution. It is evident that the redistribution of synaptic weights is substantial, reaching far beyond a simple overall rescaling of the mean, as presented in Fig. 4. The excitatory weights, and to a certain extent also the inhibitory weights, tend to pile up at the pruning threshold, which has been set to zero. Trying exponential and lognormal fits we found that the excitatory weight distribution follows fairly well a lognormal distribution.
System size and simulation time effects
The comparison between networks with N = 400 and N = 3200 presented in Fig. 5 shows that the overall functional form of the weight distribution changes qualitatively for the inhibitory weights, but not for the excitatory weights. The small additional peak visible for N = 3200 for the inhibitory links corresponds to the synaptic weights of the links reinserted after pruning.
The mean weights, which are also presented in Fig. 5, scale down with increasing systems size. For the data presented in Fig. 5 the connection probability is p = 0.2 for both N = 400 and N = 3200. It is then an interesting question which kind of scaling autonomous Hebbian learning would produce. Our attempts to determine how the synaptic weights scale with respect to the mean number of afferent synapse Z = pN were however not successful. For the data presented in Fig. 5 we note that the ratio of the mean synaptic weights is about a factor two for N = 400 and N = 3200, with the corresponding ratio of Z being 1/8.
Comparing weight distributions for a fixed simulation time is not meaningful for systems, as our SOPBN, that do not stop evolving. Average weights continue to drop even for longterm simulations, as evident in part in Fig. 4. We find that the system switches to a new state (characterized either by limit cycles, fixpoints or by very long quiet periods) after extended transients, which are at least of the order of several hours. The irregular state observed, as in Fig. 3, corresponds therefore to a transient state. The transients last however orders of magnitude longer than the time scales relevant for information processing in biological networks, which range typically from milliseconds to seconds.
Selforganized balanced synaptic weights
The results presented hitherto in Figs 2, 3, 4 and 5 have been for 80/20 systems where the initial synaptic weights had been drawn from balanced distributions. Going one step further we now examine whether the Hebbian plasticity rule (2) is able to transform a nonbalanced weight distribution into a balanced distribution.
We present in Fig. 6 the evolution of the synaptic weights for a 50/50 system, for which the initial synaptic weights had been drawn from Gaussians with means 7.5 (−15.0) and standard deviations 0.375 (1.5) for excitatory and inhibitory synapses, respectively. One notices that the autonomous Hebbian learning rule (2) balances the initially unbalanced synaptic weight distribution as fast as possible, that is, on the timescale 1/ε_{ w } = 100 s. Equivalent results were obtained for initially unbalanced 80/20 systems.
The distribution of synaptic weights selforganizes, as evident from the data presented in Fig. 6, becoming fully symmetric within one hour of Hebbian adaption. The same is found for initially nonbalanced 80/20 networks (not shown), for which the final synaptic weight is also balanced, albeit nonsymmetric.
Would any Hebbian learning rule lead to balanced synaptic weights?
A range of distinct synaptic plasticity rules are Hebbian in the sense that they perform a principal component analysis (PCA) whenever a direction in the space of input activities presents a larger variance with respect to all other input directions^{32}. Examples are the flux rule (2), which may be derived from the stationarity principle for statistical learning^{36}, and Oja’s rule^{46},
In order to work with average synaptic weight changes \(\langle {\dot{w}}_{ij}\rangle \) of comparable magnitude, one needs to rescale the adaption rate ε_{oja} with respect to ε_{ w }, which enters the flux rule (2). We use ε_{oja} = 10ε_{ w }.
In Fig. 8 the time evolution of the average excitatory and inhibitory synaptic weights, as produced by Oja’s rule (8), are presented. Oja’s rule leads to a complete rescaling of the inhibitory weights and hence to a maximally unbalanced synaptic weight distribution, which is furthermore characterized by intermittent periods of abrupt changes.
Synaptic weight growth is limited by both Oja’s and by the flux rule, namely as a consequence of the additive damping factor for the case of Oja’s rule (8) and as the result of the multiplicative limiting factor G(x) = x_{0} + x(1 − 2y) for the case of the flux rule (2). For comparison we performed simulations where we replaced G(x) in (2) by a constant. We find in this case that the excitatory weights are rescaled to zero. The synaptic weight distribution is therefore also maximally unbalanced. The runaway growth of the inhibitory synaptic weights showing up in Fig. 8, which is due to the removal of the limiting factor G(x) in (2), is accompanied by a respective evolution of the threshold, via (3), such that the average activity remains close to y_{ t } = 0.2.
The flux rule (2) is manifestly only a function of the membrane potential x_{ i } and of the effective presynaptic activity φ_{ j }u_{ j }y_{ j }, which is in turn positive. The overall functional form follows closely that of a cubic polynomial^{36},
where the x^{±} denote the roots of G(x) = x_{0} + x(1 − 2y). Stationarity is achieved when the time average of (9) vanishes, that is when the average membrane potential 〈x_{ i }〉 is on the order of the size of the roots x^{±} and b/2 of G(x)H(x).

The threshold b, which is determined via the sigmoidal (1) by the target activity y_{ t }, is of order unity whenever this is the case for the average membrane potential 〈x_{ i }〉.

It is viceversa true, that the average membrane potential 〈x_{ i }〉 will be of order unity, as long as this is the case for the roots x^{±} and b/2 of G(x)H(x).
These two conditions are mutually compatible. It is from this point not surprising that the flux rule leads on average to small membrane potentials, as evident in Fig. 3, and consequently also to approximately balanced synaptic weight distributions. We note in contrast that Oja’s rule (8) is explicitly dependent in addition on the weight w_{ ij } of the adapting synapse.
We conclude that not every Hebbian learning rule will produce balanced irregular dynamics. While we have pointed out here at some differences between the Flux rule and Oja’s rule, which may hint at the conditions for a rule to achieve this state, further work is necessary to determine which families of rules can and cannot perform this task.
EI balance in terms of EI correlations
To now quantify the degree of balance between excitation and inhibition, we compute for a given neuron the crosscorrelation \({C}_{i}^{\pm }\) between the total excitatory incoming synaptic current \({x}_{i}^{(exc)}\), as defined by (5), and the total inhibitory synaptic current \({x}_{i}^{(inh)}\), averaged first with respect to time and then across all neurons of the network:
Here we have denoted with \({\sigma }_{i}^{(exc)}\) and \({\sigma }_{i}^{(inh)}\) the standard deviations of \({x}_{i}^{(exc)}\) and respectively of \({x}_{i}^{(inh)}\).
In Fig. 7 we present the cross correlation ρ^{±} for the 50/50 system discussed in Fig. 6, for which the initial weight configurations are not balanced. Note that the time scale for Hebbian learning is 1/ε_{ w } = 100 sec, which is a order of magnitude larger than the interval of 10 sec used for evaluating ρ^{±} via (10). Analogous investigations for an 80/20 system can be found in the Supplementary Material in Fig. S3.
The cross correlation characterizing the EI balance of the initial state is only marginally dependent on whether shortterm and/or intrinsic plasticity are active. Its surprisingly large overall value, about (45–50)%, reflects the presence of substantial interneuronal activity correlations, which we did not investigate further. Comparing with the data presented in Fig. 6 one notices that ρ^{±} is a somewhat less sensible yardstick for EI balance than the bare synaptic weight balance, which renormalizes to small values in a balanced state. The data shown in Fig. 7 confirms otherwise that the Hebbian plasticity rule (2) leads to a highly balanced state.
We have so far considered here networks without any external noise, which would not be the case in the brain. A state characterized by irregular neural activity is generically expected to be robust against moderate noise levels. Performing simulations with additive input noise, characterized by zero means and a standard deviation of (5–10)%, with respect to the mean of the bare input, we found this expectation to hold. The cross correlation ρ^{±} barely changes as long as the level of noise present remains moderate. The situation changes gradually with increasing noise strength, with EI balance breaking down when the noise level reaches about 50% of the bare input strength (cf. Fig. S1 in the Supplementary Material).
Discussion
We have examined here the question of whether it would be plausible for a neural network in which both intrinsic and synaptic (E as well as I connections) parameters are continuously evolving to achieve balance both in terms of weights and activities, in a fully unsupervised way, finding that this is indeed possible. The resulting balanced network (which we have denoted here SOPBN) arises in a selforganized fashion, in analogy to the critical state characterizing possibly certain aspects of cortical dynamics^{47}. We studied for this purpose the influence of continuously ongoing Hebbian plasticity within autonomous networks of rateencoding neurons, finding that the synaptic plasticity rule that follows from the stationarity principle of statistical learning, the flux rule, does indeed induce a balanced synaptic weight distribution, even when the initial distribution is strongly unbalanced.
EI balance induced by Hebbian learning
Comparing the flux rule with and without the selflimiting term and Oja’s rule, we have found that Hebbian learning leads to a balanced distribution of synaptic weights, and hence also to a balanced state, whenever the learning rule favors small average membrane potentials. It is not necessary, for this to happen, that the learning rule constrains the overall input to strictly vanish on average, it suffices that the time averaged input remains of the order of the neural parameters, such as the inverse slope of the transfer function in (1). We found that the flux rule, as defined by (2) and (9), fulfills this requirement. An example of a Hebbian rule not leading to a balanced weight distribution is on the other side given by Oja’s rule (8).
Rate encoding neurons showing spikelike neural activity
An EI balanced state is characterized in addition to the small average membrane potential by the near cancellation of two large drivings in the form of large excitatory and inhibitory inputs. Such a state is highly sensible to small imbalances resulting either from additional external signals or from internal fluctuations. We find these imbalances to be strong enough in SOPBNs to induce short spikelike bursts in the neural activity, as observed e.g. in Fig. 2. This is quite remarkable, as one could have expected that the rateencoding neurons used for the present study would be more likely to lead to slowly and hence to smoothly varying dynamical states.
Asynchronous neural activity
The near cancellation of large excitatory and inhibitory drivings stabilizes asynchronous neural activity, as illustrated in Fig. 3 in terms of the membrane potential. Using the 0–1 test for chaos^{48} we found the asynchronous state in SOPBNs to be at least strongly irregular (cf. Fig. S2 in the Supplementary Material). As indicators for chaos one may have analyzed the time intervals between activity spikes^{49} or the Lyapunov exponents of the system. The observation that the synaptic weight distribution changes continuously, as demonstrated in Fig. 6, over time scales of hours, proves in any case that the neural activity is irregular on extended times scales. The limit of infinitely long times is not the focus of this study, as real neural systems are not expected to function for prolonged periods in the absence of stimuli.
Absence of a stationary autonomous state
We find, as shown in Fig. 4, that the size of the mean synaptic weights decays slowly but continuously. Experimenting with different ensembles of initial weight statistics we found no instance where Hebbian learning retaining EI balance would lead to a systematic increase in magnitude of the overall mean synaptic weights. We note, however, that this observation holds only for the here considered case of isolated networks, hence without an additional external driving. An adaption rate ε_{ w } that would fade out slowing, being only initially large, would also preempt the long term decay of average synaptic weights.
Theory vs. experiment
The dynamic balance of excitation and inhibition is observed experimentally within a range of distinct settings^{1,5}. Multielectrode recordings in human and monkey neocortex suggests that EI balance is caused in essence by local recurrent activity^{50}, and not by external inputs, with irregular bursting activity showing up on a range of time scales that starts, as for SOPBNs, at a few hundred milliseconds. It is also interesting that the independent adjustment of synapses connecting inhibitory to layer 2/3 pyramidal neurons in the mouse primary visual cortex has been found to be key for EI balance to occur on a singleneuron level^{51}. These findings concur with the results for the single neuron cross correlation presented in Fig. 7, for which the network average has been performed only as a second step. Furthermore we note that both the self organized bursting states observed in SOPBNs, see Fig. 6, and the alternating up and down states observed for in vitro prefrontal and occipital ferret slices are characterized by the asynchronous participation of all neurons^{2}.
Outlook
Which configuration of synaptic weights results from continuously ongoing internal Hebbian learning? We presented here a first inroad into this subject, focusing in particular on the selforganized emergence of EI balance in terms of large but nearly canceling excitatory and inhibitory inputs. We find that not all selflimiting Hebbian plasticity rules are able to do the job. There is on the other hand no need for a Hebbian learning rule to enforce EI balance explicitly. We find that EI balance already emerges when the Hebbian learning rule favors membrane potentials which are small with respect to the variance of the inputs, being nevertheless large enough to be relevant for the neural transfer function.
References
 1.
SanchezVives, M. V. & McCormick, D. A. Cellular and network mechanisms of rhythmic recurrent activity in neocortex. Nature neuroscience 3, 1027 (2000).
 2.
Shu, Y., Hasenstaub, A. & McCormick, D. A. Turning on and off recurrent balanced cortical activity. Nature 423, 288–293 (2003).
 3.
Tahvildari, B., Wölfel, M., Duque, A. & McCormick, D. A. Selective functional interactions between excitatory and inhibitory cortical neurons and differential contribution to persistent activity of the slow oscillation. Journal of Neuroscience 32, 12165–12179 (2012).
 4.
Atallah, B. V. & Scanziani, M. Instantaneous modulation of gamma oscillation frequency by balancing excitation with inhibition. Neuron 62, 566–577 (2009).
 5.
Haider, B., Duque, A., Hasenstaub, A. R. & McCormick, D. A. Neocortical network activity in vivo is generated through a dynamic balance of excitation and inhibition. Journal of Neuroscience 26, 4535–4545 (2006).
 6.
Okun, M. & Lampl, I. Instantaneous correlation of excitation and inhibition during ongoing and sensoryevoked activities. Nature neuroscience 11, 535–537 (2008).
 7.
Taub, A. H., Katz, Y. & Lampl, I. Cortical balance of excitation and inhibition is regulated by the rate of synaptic activity. Journal of Neuroscience 33, 14359–14368 (2013).
 8.
Van Vreeswijk, C. & Sompolinsky, H. Chaos in neuronal networks with balanced excitatory and inhibitory activity. Science 274, 1724–1726 (1996).
 9.
Van Vreeswijk, C. & Sompolinsky, H. Chaotic balanced state in a model of cortical circuits. Neural computation 10, 1321–1371 (1998).
 10.
Shadlen, M. N. & Newsome, W. T. The variable discharge of cortical neurons: implications for connectivity, computation, and information coding. Journal of neuroscience 18, 3870–3896 (1998).
 11.
Poil, S.S., Hardstone, R., Mansvelder, H. D. & LinkenkaerHansen, K. Criticalstate dynamics of avalanches and oscillations jointly emerge from balanced excitation/inhibition in neuronal networks. Journal of Neuroscience 32, 9817–9823 (2012).
 12.
Haldeman, C. & Beggs, J. M. Critical branching captures activity in living neural networks and maximizes the number of metastable states. Physical review letters 94, 058101 (2005).
 13.
Beggs, J. M. & Plenz, D. Neuronal avalanches are diverse and precise activity patterns that are stable for many hours in cortical slice cultures. Journal of neuroscience 24, 5216–5229 (2004).
 14.
Legenstein, R. & Maass, W. Edge of chaos and prediction of computational performance for neural circuit models. Neural Networks 20, 323–334 (2007).
 15.
Renart, A. et al. The asynchronous state in cortical circuits. Science 327, 587–590 (2010).
 16.
Ecker, A. S. et al. Decorrelated neuronal firing in cortical microcircuits. Science 327, 584–587 (2010).
 17.
Ostojic, S. Two types of asynchronous activity in networks of excitatory and inhibitory spiking neurons. Nature neuroscience 17, 594–600 (2014).
 18.
Vogels, T. P. & Abbott, L. Gating multiple signals through detailed balance of excitation and inhibition in spiking networks. Nature neuroscience 12, 483–491 (2009).
 19.
Rubenstein, J. & Merzenich, M. M. Model of autism: increased ratio of excitation/inhibition in key neural systems. Genes, Brain and Behavior 2, 255–267 (2003).
 20.
Yizhar, O. et al. Neocortical excitation/inhibition balance in information processing and social dysfunction. Nature 477, 171–178 (2011).
 21.
Ahmadian, Y., Rubin, D. B. & Miller, K. D. Analysis of the stabilized supralinear network. Neural computation 25, 1994–2037 (2013).
 22.
Rubin, D. B., Van Hooser, S. D. & Miller, K. D. The stabilized supralinear network: a unifying circuit motif underlying multiinput integration in sensory cortex. Neuron 85, 402–417 (2015).
 23.
Hennequin, G., Ahmadian, Y., Rubin, D. B., Lengyel, M. & Miller, K. D. Stabilized supralinear network dynamics account for stimulusinduced changes of noise variability in the cortex. bioRxiv 094334 (2016).
 24.
Hennequin, G., Vogels, T. P. & Gerstner, W. Optimal control of transient dynamics in balanced networks supports generation of complex movements. Neuron 82, 1394–1406 (2014).
 25.
Hennequin, G., Agnes, E. J. & Vogels, T. P. Inhibitory plasticity: Balance, control, and codependence. Annual Review of Neuroscience 40, 557–579 (2017).
 26.
Vogels, T. P., Sprekeler, H., Zenke, F., Clopath, C. & Gerstner, W. Inhibitory plasticity balances excitation and inhibition in sensory pathways and memory networks. Science 334, 1569–1573 (2011).
 27.
Bliss, T. V. & Lømo, T. Longlasting potentiation of synaptic transmission in the dentate area of the anaesthetized rabbit following stimulation of the perforant path. The Journal of physiology 232, 331–356 (1973).
 28.
Alger, B. & Teyler, T. Longterm and shortterm plasticity in the ca1, ca3, and dentate regions of the rat hippocampal slice. Brain research 110, 463–480 (1976).
 29.
Bi, G.Q. & Poo, M.M. Synaptic modifications in cultured hippocampal neurons: dependence on spike timing, synaptic strength, and postsynaptic cell type. Journal of neuroscience 18, 10464–10472 (1998).
 30.
Desai, N. S., Rutherford, L. C. & Turrigiano, G. G. Plasticity in the intrinsic excitability of cortical pyramidal neurons. Nature Neuroscience 2, 515–520 (1999).
 31.
Castillo, P. E., Chiu, C. Q. & Carroll, R. C. Longterm plasticity at inhibitory synapses. Current opinion in neurobiology 21, 328–338 (2011).
 32.
Echeveste, R. & Gros, C. Generating functionals for computational intelligence: The fisher information as an objective function for selflimiting hebbian learning rules. Frontiers in Robotics and AI 1, 1 (2014).
 33.
Priebe, N. J., Mechler, F., Carandini, M. & Ferster, D. The contribution of spike threshold to the dichotomy of cortical simple and complex cells. Nature neuroscience 7, 1113–1122 (2004).
 34.
Priebe, N. J. & Ferster, D. Inhibition, spike threshold, and stimulus selectivity in primary visual cortex. Neuron 57, 482–497 (2008).
 35.
Miller, K. D. & Troyer, T. W. Neural noise can explain expansive, powerlaw nonlinearities in neural response functions. Journal of neurophysiology 87, 653–659 (2002).
 36.
Echeveste, R., Eckmann, S. & Gros, C. The fisher information as a neural guiding principle for independent component analysis. Entropy 17, 3838–3856 (2015).
 37.
Turrigiano, G. G. & Nelson, S. B. Hebb and homeostasis in neuronal plasticity. Current opinion in neurobiology 10, 358–364 (2000).
 38.
Triesch, J. Synergies between intrinsic and synaptic plasticity in individual model neurons. In Advances in neural information processing systems, 1417–1424 (2005).
 39.
Markovic, D. & Gros, C. Selforganized chaos through polyhomeostatic optimization. Physical Review Letters 105, 068702 (2010).
 40.
Marković, D. & Gros, C. Intrinsic adaptation in autonomous recurrent neural networks. Neural Computation 24, 523–540 (2012).
 41.
Regehr, W. G. Shortterm presynaptic plasticity. Cold Spring Harbor perspectives in biology 4, a005702 (2012).
 42.
Mongillo, G., Barak, O. & Tsodyks, M. Synaptic theory of working memory. Science 319, 1543–1546 (2008).
 43.
Wang, Y. et al. Heterogeneity in the pyramidal network of the medial prefrontal cortex. Nature neuroscience 9, 534–542 (2006).
 44.
Gupta, A., Wang, Y. & Markram, H. Organizing principles for a diversity of gabaergic interneurons and synapses in the neocortex. Science 287, 273–278 (2000).
 45.
Martin, L., Sándor, B. & Gros, C. Closedloop robots driven by shortterm synaptic plasticity: Emergent explorative vs. limitcycle locomotion. Frontiers in neurorobotics 10, 12 (2016).
 46.
Oja, E. The nonlinear pca learning rule in independent component analysis. Neurocomputing 17, 25–45 (1997).
 47.
Marković, D. & Gros, C. Power laws and selforganized criticality in theory and nature. Physics Reports 536, 41–74 (2014).
 48.
Wernecke, H., Sándor, B. & Gros, C. How to test for partially predictable chaos. Scientific Reports 7, 1087 (2017).
 49.
Echeveste, R. & Gros, C. Drifting states and synchronization induced chaos in autonomous networks of excitable neurons. Frontiers in computational neuroscience 10, 98 (2016).
 50.
Dehghani, N. et al. Dynamic balance of excitation and inhibition in human and monkey neocortex. Scientific reports 6, 23176 (2016).
 51.
Xue, M., Atallah, B. V. & Scanziani, M. Equalizing excitation–inhibition ratios across visual cortical neurons. Nature 511, 596 (2014).
Acknowledgements
We thank Fabian Schubert for discussions.
Author information
Affiliations
Contributions
P.T. performed the simulations, R.E. contributed to the discussions, with the manuscript being written by C.G.
Corresponding author
Ethics declarations
Competing Interests
The authors declare no competing interests.
Additional information
Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Electronic supplementary material
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Trapp, P., Echeveste, R. & Gros, C. EI balance emerges naturally from continuous Hebbian learning in autonomous neural networks. Sci Rep 8, 8939 (2018). https://doi.org/10.1038/s41598018270995
Received:
Accepted:
Published:
Further reading

Evolutionary conservation and divergence of the human brain transcriptome
Genome Biology (2021)

Local Homeostatic Regulation of the Spectral Radius of EchoState Networks
Frontiers in Computational Neuroscience (2021)

A devil’s advocate view on ‘selforganized’ brain criticality
Journal of Physics: Complexity (2021)
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.