Literature DB >> 17928565

Reinforcement learning with modulated spike timing dependent synaptic plasticity.

Michael A Farries1, Adrienne L Fairhall.   

Abstract

Spike timing-dependent synaptic plasticity (STDP) has emerged as the preferred framework linking patterns of pre- and postsynaptic activity to changes in synaptic strength. Although synaptic plasticity is widely believed to be a major component of learning, it is unclear how STDP itself could serve as a mechanism for general purpose learning. On the other hand, algorithms for reinforcement learning work on a wide variety of problems, but lack an experimentally established neural implementation. Here, we combine these paradigms in a novel model in which a modified version of STDP achieves reinforcement learning. We build this model in stages, identifying a minimal set of conditions needed to make it work. Using a performance-modulated modification of STDP in a two-layer feedforward network, we can train output neurons to generate arbitrarily selected spike trains or population responses. Furthermore, a given network can learn distinct responses to several different input patterns. We also describe in detail how this model might be implemented biologically. Thus our model offers a novel and biologically plausible implementation of reinforcement learning that is capable of training a neural population to produce a very wide range of possible mappings between synaptic input and spiking output.

Entities:  

Mesh:

Year:  2007        PMID: 17928565     DOI: 10.1152/jn.00364.2007

Source DB:  PubMed          Journal:  J Neurophysiol        ISSN: 0022-3077            Impact factor:   2.714


  43 in total

1.  A biophysically-based neuromorphic model of spike rate- and timing-dependent plasticity.

Authors:  Guy Rachmuth; Harel Z Shouval; Mark F Bear; Chi-Sang Poon
Journal:  Proc Natl Acad Sci U S A       Date:  2011-11-16       Impact factor: 11.205

2.  Premotor synaptic plasticity limited to the critical period for song learning.

Authors:  Max Sizemore; David J Perkel
Journal:  Proc Natl Acad Sci U S A       Date:  2011-10-03       Impact factor: 11.205

Review 3.  A hypothesis for basal ganglia-dependent reinforcement learning in the songbird.

Authors:  M S Fee; J H Goldberg
Journal:  Neuroscience       Date:  2011-10-13       Impact factor: 3.590

Review 4.  The role of efference copy in striatal learning.

Authors:  Michale S Fee
Journal:  Curr Opin Neurobiol       Date:  2014-02-21       Impact factor: 6.627

5.  Variation in sequence dynamics improves maintenance of stereotyped behavior in an example from bird song.

Authors:  Alison Duffy; Elliott Abe; David J Perkel; Adrienne L Fairhall
Journal:  Proc Natl Acad Sci U S A       Date:  2019-04-23       Impact factor: 11.205

6.  Dopaminergic modulation of basal ganglia output through coupled excitation-inhibition.

Authors:  Agata Budzillo; Alison Duffy; Kimberly E Miller; Adrienne L Fairhall; David J Perkel
Journal:  Proc Natl Acad Sci U S A       Date:  2017-05-15       Impact factor: 11.205

7.  Reinforcement learning of two-joint virtual arm reaching in a computer model of sensorimotor cortex.

Authors:  Samuel A Neymotin; George L Chadderdon; Cliff C Kerr; Joseph T Francis; William W Lytton
Journal:  Neural Comput       Date:  2013-09-18       Impact factor: 2.026

Review 8.  Dopaminergic system in birdsong learning and maintenance.

Authors:  Lubica Kubikova; Lubor Kostál
Journal:  J Chem Neuroanat       Date:  2009-11-10       Impact factor: 3.052

9.  Spike-based reinforcement learning in continuous state and action space: when policy gradient methods fail.

Authors:  Eleni Vasilaki; Nicolas Frémaux; Robert Urbanczik; Walter Senn; Wulfram Gerstner
Journal:  PLoS Comput Biol       Date:  2009-12-04       Impact factor: 4.475

10.  Synaptic theory of replicator-like melioration.

Authors:  Yonatan Loewenstein
Journal:  Front Comput Neurosci       Date:  2010-06-17       Impact factor: 2.380

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.