Literature DB >> 20926659

Functional requirements for reward-modulated spike-timing-dependent plasticity.

Nicolas Frémaux1, Henning Sprekeler, Wulfram Gerstner.   

Abstract

Recent experiments have shown that spike-timing-dependent plasticity is influenced by neuromodulation. We derive theoretical conditions for successful learning of reward-related behavior for a large class of learning rules where Hebbian synaptic plasticity is conditioned on a global modulatory factor signaling reward. We show that all learning rules in this class can be separated into a term that captures the covariance of neuronal firing and reward and a second term that presents the influence of unsupervised learning. The unsupervised term, which is, in general, detrimental for reward-based learning, can be suppressed if the neuromodulatory signal encodes the difference between the reward and the expected reward-but only if the expected reward is calculated for each task and stimulus separately. If several tasks are to be learned simultaneously, the nervous system needs an internal critic that is able to predict the expected reward for arbitrary stimuli. We show that, with a critic, reward-modulated spike-timing-dependent plasticity is capable of learning motor trajectories with a temporal resolution of tens of milliseconds. The relation to temporal difference learning, the relevance of block-based learning paradigms, and the limitations of learning with a critic are discussed.

Mesh:

Year:  2010        PMID: 20926659      PMCID: PMC6634722          DOI: 10.1523/JNEUROSCI.6249-09.2010

Source DB:  PubMed          Journal:  J Neurosci        ISSN: 0270-6474            Impact factor:   6.167


  43 in total

1.  Central Cholinergic Neurons Are Rapidly Recruited by Reinforcement Feedback.

Authors:  Balázs Hangya; Sachin P Ranade; Maja Lorenc; Adam Kepecs
Journal:  Cell       Date:  2015-08-27       Impact factor: 41.582

2.  Striatal action-value neurons reconsidered.

Authors:  Lotem Elber-Dorozko; Yonatan Loewenstein
Journal:  Elife       Date:  2018-05-31       Impact factor: 8.140

3.  Evidence for a causal inverse model in an avian cortico-basal ganglia circuit.

Authors:  Nicolas Giret; Joergen Kornfeld; Surya Ganguli; Richard H R Hahnloser
Journal:  Proc Natl Acad Sci U S A       Date:  2014-04-07       Impact factor: 11.205

4.  Spike-based decision learning of Nash equilibria in two-player games.

Authors:  Johannes Friedrich; Walter Senn
Journal:  PLoS Comput Biol       Date:  2012-09-27       Impact factor: 4.475

5.  Distinct Eligibility Traces for LTP and LTD in Cortical Synapses.

Authors:  Kaiwen He; Marco Huertas; Su Z Hong; XiaoXiu Tie; Johannes W Hell; Harel Shouval; Alfredo Kirkwood
Journal:  Neuron       Date:  2015-10-22       Impact factor: 17.173

6.  Reinforcement learning using a continuous time actor-critic framework with spiking neurons.

Authors:  Nicolas Frémaux; Henning Sprekeler; Wulfram Gerstner
Journal:  PLoS Comput Biol       Date:  2013-04-11       Impact factor: 4.475

7.  A Dynamic Connectome Supports the Emergence of Stable Computational Function of Neural Circuits through Reward-Based Learning.

Authors:  David Kappel; Robert Legenstein; Stefan Habenschuss; Michael Hsieh; Wolfgang Maass
Journal:  eNeuro       Date:  2018-04-24

Review 8.  Neuronal Reward and Decision Signals: From Theories to Data.

Authors:  Wolfram Schultz
Journal:  Physiol Rev       Date:  2015-07       Impact factor: 37.312

9.  Reshaping Movement Distributions With Limit-Push Robotic Training.

Authors:  Amit K Shah; Ian Sharp; Eyad Hajissa; James L Patton
Journal:  IEEE Trans Neural Syst Rehabil Eng       Date:  2018-05-21       Impact factor: 3.802

10.  Reward-based learning for virtual neurorobotics through emotional speech processing.

Authors:  Laurence C Jayet Bray; Gareth B Ferneyhough; Emily R Barker; Corey M Thibeault; Frederick C Harris
Journal:  Front Neurorobot       Date:  2013-04-29       Impact factor: 2.650

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.