Can neocortical feedback alter the sign of plasticity?

Richards, Blake A.; Lillicrap, Timothy P.

doi:10.1038/s41583-018-0049-5

Download PDF

Correspondence
Published: 14 August 2018

Can neocortical feedback alter the sign of plasticity?

Nature Reviews Neuroscience volume 19, page 636 (2018)Cite this article

3219 Accesses
6 Citations
19 Altmetric
Metrics details

Subjects

The recent Review by Roelfsema and Holtmaat (Control of synaptic plasticity in deep cortical networks. Nat. Rev. Neurosci. 19, 166–180 (2018))¹ provides a much-needed guide to learning in deep cortical networks. The importance of credit assignment for deep cortical networks has come into focus recently with the success of deep learning in artificial intelligence². The learning algorithm that is typically used for credit assignment in deep artificial neural networks, the backpropagation-of-error algorithm³, is biologically infeasible⁴. Yet, its successful application to complicated tasks suggests that credit assignment is important for learning in non-trivial circumstances, whether in artificial or biological neural networks.

As Roelfsema and Holtmaat note, the focus in neuroscience to date has been either on Hebbian plasticity mechanisms⁵, or three-factor Hebbian plasticity rules that incorporate a global reward prediction error⁶. They argue that a more powerful approach is to use feedback signals that can determine credit on a neuron-by-neuron basis. We agree with the authors that this is a likely role for feedback connections in learning. A question that is left open, though, is whether feedback signals act purely as a gating mechanism, and if so, is that enough to solve the credit assignment problem?

According to one framework that Roelfsema and Holtmaat explore in their paper, changes (Δ) in the strength of a synapse between presynaptic neuron i and postsynaptic neuron j (w_ij) are guided by a four-term equation (which we simplify here slightly):

$$\Delta {w}_{ij}={f}_{i}\cdot {f}_{j}\cdot RPE\cdot F{B}_{j}$$

(1)

where f_i and f_j are functions of presynaptic and postsynaptic activity, respectively, RPE is a global reward prediction error communicated via neuromodulators and FB_j is the feedback received by the postsynaptic neuron. In the Review, Roelfsema and Holtmaat suggest that the feedback signal, FB_j, could be a gating signal ranging from 0 to 1, such that it can turn synaptic plasticity in a neuron on or off but cannot alter the sign of synaptic plasticity (for example, whether synapses potentiate or depress). Instead, the term RPE determines the sign of plasticity. However, in our opinion it may be important for FB_j to determine whether neurons potentiate or depress their synapses.

Roelfsema and Holtmaat state that the weight changes from equation 1 are equivalent to those prescribed by backpropagation-of-error, but the equivalence is on the weight changes on average, and this point is crucial. Notably, even random search algorithms, such as REINFORCE⁷, also agree with backpropagation-of-error on average. This means that for an individual stimulus, algorithms like REINFORCE or those prescribed by equation 1 do not follow the true gradient. Instead, these algorithms only follow the true gradient when their synaptic weight updates are averaged across many repetitions of the same stimuli. Put another way, equation 1 uses an estimator of the true gradient followed by backpropagation-of-error. There are two key questions pertinent to this approach. First, what is the variance of this estimator? Second, how long does it take to reach good performance for a given task⁸? We speculate that if a task requires learning a high-dimensional, complex function, the variance of the estimator will be high and it will take an intractably long time to reach reasonable levels of performance. For example, to the best of our knowledge, there are no examples in the literature of successfully training a good ImageNet classifier using REINFORCE-like algorithms. Algorithms like AGREL⁹, which uses feedback-based gating, can have better variance properties than REINFORCE, but whether the variance in the estimator is small enough to learn high-dimensional, complex tasks in a reasonable amount of time remains to be determined.

Given these considerations, we propose that neuroscientists should consider how feedback in the neocortex may do more than act as a gating mechanism. In other words, we postulate that neocortical feedback may be set up to communicate signed credit signals that cause some neurons to potentiate and others to depress. One possibility is to use the temporal order of feedback onto specific dendrites as a signal of sign^10,11. Another possibility is to use inhibitory interneuron circuits to calculate a difference¹². Ultimately, we believe that neuroscientists should not assume that feedback acts only as a gating mechanism. Importantly, we are not arguing that feedback never acts as a gating signal. Indeed, recent evidence from the Holtmaat group shows feedback-based gating of plasticity¹³, although this does not preclude signed credit assignment. Prejudging that possibility could lead our investigations on this important topic astray.

There is a reply to this letter by Roelfsema, P. R. & Holtmaat, A. Nat. Rev. Neurosci. https://doi.org/10.1038/s41583-018-0048-6 (2018).

References

Roelfsema, P. R. & Holtmaat, A. Control of synaptic plasticity in deep cortical networks. Nat. Rev. Neurosci. 19, 166–180 (2018).
Article CAS Google Scholar
LeCun, Y., Bengio, Y. & Hinton, G. Deep learning. Nature 521, 436–444 (2015).
Article CAS Google Scholar
Rumelhart, D. E., Hinton, G. E. & Williams, R. J. Learning representations by back-propagating errors. Nature 323, 533–536 (1986).
Article Google Scholar
Crick, F. The recent excitement about neural networks. Nature 337, 129–132 (1989).
Article CAS Google Scholar
Bi, G. & Poo, M. Synaptic modification by correlated activity: Hebb’s postulate revisited. Annu. Rev. Neurosci. 24, 139–166 (2001).
Article CAS Google Scholar
Frémaux, N. & Gerstner, W. Neuromodulated spike-timing-dependent plasticity, and theory of three-factor learning rules. Front. Neural Circuits 9, 85 (2016).
Article Google Scholar
Williams, R. J. Proc. IEEE Int. Conf. Neural Networks 1, 263–270 (San Diego, CA., 1988).
Werfel, J., Xie, X. & Seung, H. S. Learning curves for stochastic gradient descent in linear feedforward networks. Neural Comput. 17, 2699–2718 (2005).
Article Google Scholar
Roelfsema, P. R. & van Ooyen, A. Attention-gated reinforcement learning of internal representations for classification. Neural Comput. 17, 2176–2214 (2005).
Article Google Scholar
Guerguiev, J., Lillicrap, T. P. & Richards, B. A. Towards deep learning with segregated dendrites. eLife 6, e22901 (2017).
Article Google Scholar
Bono, J. & Clopath, C. Modeling somatic and dendritic spike mediated plasticity at the single neuron and network level. Nat. Commun. 8, 706 (2017).
Article Google Scholar
Sacramento, J., Costa, R. P., Bengio, Y. & Senn, W. Dendritic error backpropagation in deep cortical microcircuits. ArXiv 1801.00062 (2017).
Williams, L. E. & Holtmaat, A. Higher-order thalamocortical inputs gate synaptic long-term potentiation via disin-hibiton. bioRxiv https://doi.org/10.1101/281477 (2018).

Download references

Author information

Authors and Affiliations

Department of Biological Sciences, University of Toronto Scarborough, Toronto, Ontario, Canada
Blake A. Richards
Learning in Machines and Brains Program, Canadian Institute for Advanced Research, Toronto, Ontario, Canada
Blake A. Richards
DeepMind Inc., London, UK
Timothy P. Lillicrap

Authors

Blake A. Richards
View author publications
You can also search for this author in PubMed Google Scholar
Timothy P. Lillicrap
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Blake A. Richards.

Ethics declarations

Competing interests

The authors declare no competing interests.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Richards, B.A., Lillicrap, T.P. Can neocortical feedback alter the sign of plasticity?. Nat Rev Neurosci 19, 636 (2018). https://doi.org/10.1038/s41583-018-0049-5

Download citation

Published: 14 August 2018
Issue Date: October 2018
DOI: https://doi.org/10.1038/s41583-018-0049-5

This article is cited by

Breakdown of category-specific word representations in a brain-constrained neurocomputational model of semantic dementia
- Yury Shtyrov
- Aleksei Efremov
- Max Garagnani
Scientific Reports (2023)
Biologically inspired visual computing: the state of the art
- Wangli Hao
- Ian Max Andolina
- Zhaoxiang Zhang
Frontiers of Computer Science (2021)
Reply to ‘Can neocortical feedback alter the sign of plasticity?’
- Pieter R. Roelfsema
- Anthony Holtmaat
Nature Reviews Neuroscience (2018)

Can neocortical feedback alter the sign of plasticity?

Subjects

References

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Competing interests

Rights and permissions

About this article

Cite this article

This article is cited by

Breakdown of category-specific word representations in a brain-constrained neurocomputational model of semantic dementia

Biologically inspired visual computing: the state of the art

Reply to ‘Can neocortical feedback alter the sign of plasticity?’

Control of synaptic plasticity in deep cortical networks

Reply to ‘Can neocortical feedback alter the sign of plasticity?’

Search

Quick links

Subjects

References

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Competing interests

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Breakdown of category-specific word representations in a brain-constrained neurocomputational model of semantic dementia

Biologically inspired visual computing: the state of the art

Reply to ‘Can neocortical feedback alter the sign of plasticity?’

Search

Quick links