Literature DB >> 17444757

Reinforcement learning through modulation of spike-timing-dependent synaptic plasticity.

Răzvan V Florian1.   

Abstract

The persistent modification of synaptic efficacy as a function of the relative timing of pre- and postsynaptic spikes is a phenomenon known as spike-timing-dependent plasticity (STDP). Here we show that the modulation of STDP by a global reward signal leads to reinforcement learning. We first derive analytically learning rules involving reward-modulated spike-timing-dependent synaptic and intrinsic plasticity, by applying a reinforcement learning algorithm to the stochastic spike response model of spiking neurons. These rules have several features common to plasticity mechanisms experimentally found in the brain. We then demonstrate in simulations of networks of integrate-and-fire neurons the efficacy of two simple learning rules involving modulated STDP. One rule is a direct extension of the standard STDP model (modulated STDP), and the other one involves an eligibility trace stored at each synapse that keeps a decaying memory of the relationships between the recent pairs of pre- and postsynaptic spike pairs (modulated STDP with eligibility trace). This latter rule permits learning even if the reward signal is delayed. The proposed rules are able to solve the XOR problem with both rate coded and temporally coded input and to learn a target output firing-rate pattern. These learning rules are biologically plausible, may be used for training generic artificial spiking neural networks, regardless of the neural model used, and suggest the experimental investigation in animals of the existence of reward-modulated STDP.

Mesh:

Year:  2007        PMID: 17444757     DOI: 10.1162/neco.2007.19.6.1468

Source DB:  PubMed          Journal:  Neural Comput        ISSN: 0899-7667            Impact factor:   2.026


  67 in total

1.  A biophysically-based neuromorphic model of spike rate- and timing-dependent plasticity.

Authors:  Guy Rachmuth; Harel Z Shouval; Mark F Bear; Chi-Sang Poon
Journal:  Proc Natl Acad Sci U S A       Date:  2011-11-16       Impact factor: 11.205

2.  Reinforcement learning in populations of spiking neurons.

Authors:  Robert Urbanczik; Walter Senn
Journal:  Nat Neurosci       Date:  2009-02-15       Impact factor: 24.884

3.  Alternative time representation in dopamine models.

Authors:  François Rivest; John F Kalaska; Yoshua Bengio
Journal:  J Comput Neurosci       Date:  2009-10-22       Impact factor: 1.621

4.  Early detection of hand movements from electroencephalograms for stroke therapy applications.

Authors:  A Muralidharan; J Chae; D M Taylor
Journal:  J Neural Eng       Date:  2011-05-27       Impact factor: 5.379

5.  Computational models of reinforcement learning: the role of dopamine as a reward signal.

Authors:  R D Samson; M J Frank; Jean-Marc Fellous
Journal:  Cogn Neurodyn       Date:  2010-03-21       Impact factor: 5.082

Review 6.  The role of efference copy in striatal learning.

Authors:  Michale S Fee
Journal:  Curr Opin Neurobiol       Date:  2014-02-21       Impact factor: 6.627

7.  Reinforcement learning of two-joint virtual arm reaching in a computer model of sensorimotor cortex.

Authors:  Samuel A Neymotin; George L Chadderdon; Cliff C Kerr; Joseph T Francis; William W Lytton
Journal:  Neural Comput       Date:  2013-09-18       Impact factor: 2.026

8.  Spike-based reinforcement learning in continuous state and action space: when policy gradient methods fail.

Authors:  Eleni Vasilaki; Nicolas Frémaux; Robert Urbanczik; Walter Senn; Wulfram Gerstner
Journal:  PLoS Comput Biol       Date:  2009-12-04       Impact factor: 4.475

9.  Synaptic theory of replicator-like melioration.

Authors:  Yonatan Loewenstein
Journal:  Front Comput Neurosci       Date:  2010-06-17       Impact factor: 2.380

10.  Enabling functional neural circuit simulations with distributed computing of neuromodulated plasticity.

Authors:  Wiebke Potjans; Abigail Morrison; Markus Diesmann
Journal:  Front Comput Neurosci       Date:  2010-11-23       Impact factor: 2.380

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.