Literature DB >> 9300423

Reinforcement learning by Hebbian synapses with adaptive thresholds.

C M Pennartz1.   

Abstract

A central problem in learning theory is how the vertebrate brain processes reinforcing stimuli in order to master complex sensorimotor tasks. This problem belongs to the domain of supervised learning, in which errors in the response of a neural network serve as the basis for modification of synaptic connectivity in the network and thereby train it on a computational task. The model presented here shows how a reinforcing feedback can modify synapses in a neuronal network according to the principles of Hebbian learning. The reinforcing feedback steers synapses towards long-term potentiation or depression by critically influencing the rise in postsynaptic calcium, in accordance with findings on synaptic plasticity in mammalian brain. An important feature of the model is the dependence of modification thresholds on the previous history of reinforcing feedback processed by the network. The learning algorithm trained networks successfully on a task in which a population vector in the motor output was required to match a sensory stimulus vector presented shortly before. In another task, networks were trained to compute coordinate transformations by combining different visual inputs. The model continued to behave well when simplified units were replaced by single-compartment neurons equipped with several conductances and operating in continuous time. This novel form of reinforcement learning incorporates essential properties of Hebbian synaptic plasticity and thereby shows that supervised learning can be accomplished by a learning rule similar to those used in physiologically plausible models of unsupervised learning. The model can be crudely correlated to the anatomy and electrophysiology of the amygdala, prefrontal and cingulate cortex and has predictive implications for further experiments on synaptic plasticity and learning processes mediated by these areas.

Entities:  

Mesh:

Year:  1997        PMID: 9300423     DOI: 10.1016/s0306-4522(97)00118-8

Source DB:  PubMed          Journal:  Neuroscience        ISSN: 0306-4522            Impact factor:   3.590


  10 in total

1.  Flexible neural representations of value in the primate brain.

Authors:  C Daniel Salzman; Joseph J Paton; Marina A Belova; Sara E Morrison
Journal:  Ann N Y Acad Sci       Date:  2007-09-13       Impact factor: 5.691

2.  Neural coding of reward magnitude in the orbitofrontal cortex of the rat during a five-odor olfactory discrimination task.

Authors:  Esther van Duuren; Francisco A Nieto Escámez; Ruud N J M A Joosten; Rein Visser; Antonius B Mulder; Cyriel M A Pennartz
Journal:  Learn Mem       Date:  2007-06-11       Impact factor: 2.460

3.  Theta-band phase locking of orbitofrontal neurons during reward expectancy.

Authors:  Marijn van Wingerden; Martin Vinck; Jan Lankelma; Cyriel M A Pennartz
Journal:  J Neurosci       Date:  2010-05-19       Impact factor: 6.167

4.  A reinforcement learning framework for spiking networks with dynamic synapses.

Authors:  Karim El-Laithy; Martin Bogdan
Journal:  Comput Intell Neurosci       Date:  2011-10-23

5.  A confidence metric for using neurobiological feedback in actor-critic reinforcement learning based brain-machine interfaces.

Authors:  Noeline W Prins; Justin C Sanchez; Abhishek Prasad
Journal:  Front Neurosci       Date:  2014-05-26       Impact factor: 4.677

6.  Reversal Learning in Humans and Gerbils: Dynamic Control Network Facilitates Learning.

Authors:  Christian Jarvers; Tobias Brosch; André Brechmann; Marie L Woldeit; Andreas L Schulz; Frank W Ohl; Marcel Lommerzheim; Heiko Neumann
Journal:  Front Neurosci       Date:  2016-11-17       Impact factor: 4.677

Review 7.  Learning, memory and consolidation mechanisms for behavioral control in hierarchically organized cortico-basal ganglia systems.

Authors:  Silviu I Rusu; Cyriel M A Pennartz
Journal:  Hippocampus       Date:  2019-10-16       Impact factor: 3.899

Review 8.  The Dopamine System and Automatization of Movement Sequences: A Review With Relevance for Speech and Stuttering.

Authors:  Per A Alm
Journal:  Front Hum Neurosci       Date:  2021-12-02       Impact factor: 3.169

Review 9.  Reconciling the object and spatial processing views of the perirhinal cortex through task-relevant unitization.

Authors:  Julien Fiorilli; Jeroen J Bos; Xenia Grande; Judith Lim; Emrah Düzel; Cyriel M A Pennartz
Journal:  Hippocampus       Date:  2021-02-01       Impact factor: 3.899

10.  Conditioning sharpens the spatial representation of rewarded stimuli in mouse primary visual cortex.

Authors:  Pieter M Goltstein; Guido T Meijer; Cyriel Ma Pennartz
Journal:  Elife       Date:  2018-09-17       Impact factor: 8.140

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.