Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 A spiking neural network model of an actor-critic learning agent.

Literature DB >> 19196231

A spiking neural network model of an actor-critic learning agent.

Wiebke Potjans¹, Abigail Morrison, Markus Diesmann.

Abstract

The ability to adapt behavior to maximize reward as a result of interactions with the environment is crucial for the survival of any higher organism. In the framework of reinforcement learning, temporal-difference learning algorithms provide an effective strategy for such goal-directed adaptation, but it is unclear to what extent these algorithms are compatible with neural computation. In this article, we present a spiking neural network model that implements actor-critic temporal-difference learning by combining local plasticity rules with a global reward signal. The network is capable of solving a nontrivial gridworld task with sparse rewards. We derive a quantitative mapping of plasticity parameters and synaptic weights to the corresponding variables in the standard algorithmic formulation and demonstrate that the network learns with a similar speed to its discrete time counterpart and attains the same equilibrium performance.

Entities: Disease

Mesh：

Year: 2009 PMID： 19196231 DOI： 10.1162/neco.2008.08-07-593

Source DB: PubMed Journal: Neural Comput ISSN： 0899-7667 Impact factor: 2.026

Keyword Cloud
Cited

32 in total

A spiking neural network model of an actor-critic learning agent.

1. Alternative time representation in dopamine models.

Review 2. Building functional networks of spiking model neurons.

3. Computational models of reinforcement learning: the role of dopamine as a reward signal.

4. Neuromodulation of STDP through short-term changes in firing causality.

5. Reinforcement learning with Marr.

6. Spike-based decision learning of Nash equilibria in two-player games.

7. Reinforcement learning of two-joint virtual arm reaching in a computer model of sensorimotor cortex.

8. Spike-based reinforcement learning in continuous state and action space: when policy gradient methods fail.

9. Enabling functional neural circuit simulations with distributed computing of neuromodulated plasticity.

10. Reinforcement learning using a continuous time actor-critic framework with spiking neurons.