Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Reinforcement learning with modulated spike timing dependent synaptic plasticity.

Literature DB >> 17928565

Reinforcement learning with modulated spike timing dependent synaptic plasticity.

Michael A Farries¹, Adrienne L Fairhall.

Abstract

Spike timing-dependent synaptic plasticity (STDP) has emerged as the preferred framework linking patterns of pre- and postsynaptic activity to changes in synaptic strength. Although synaptic plasticity is widely believed to be a major component of learning, it is unclear how STDP itself could serve as a mechanism for general purpose learning. On the other hand, algorithms for reinforcement learning work on a wide variety of problems, but lack an experimentally established neural implementation. Here, we combine these paradigms in a novel model in which a modified version of STDP achieves reinforcement learning. We build this model in stages, identifying a minimal set of conditions needed to make it work. Using a performance-modulated modification of STDP in a two-layer feedforward network, we can train output neurons to generate arbitrarily selected spike trains or population responses. Furthermore, a given network can learn distinct responses to several different input patterns. We also describe in detail how this model might be implemented biologically. Thus our model offers a novel and biologically plausible implementation of reinforcement learning that is capable of training a neural population to produce a very wide range of possible mappings between synaptic input and spiking output.

Entities: Chemical

Mesh：

Year: 2007 PMID： 17928565 DOI： 10.1152/jn.00364.2007

Source DB: PubMed Journal: J Neurophysiol ISSN： 0022-3077 Impact factor: 2.714

Keyword Cloud
Cited

43 in total

Reinforcement learning with modulated spike timing dependent synaptic plasticity.

1. A biophysically-based neuromorphic model of spike rate- and timing-dependent plasticity.

2. Premotor synaptic plasticity limited to the critical period for song learning.

Review 3. A hypothesis for basal ganglia-dependent reinforcement learning in the songbird.

Review 4. The role of efference copy in striatal learning.

5. Variation in sequence dynamics improves maintenance of stereotyped behavior in an example from bird song.

6. Dopaminergic modulation of basal ganglia output through coupled excitation-inhibition.

7. Reinforcement learning of two-joint virtual arm reaching in a computer model of sensorimotor cortex.

Review 8. Dopaminergic system in birdsong learning and maintenance.

9. Spike-based reinforcement learning in continuous state and action space: when policy gradient methods fail.

10. Synaptic theory of replicator-like melioration.