Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Inter-module credit assignment in modular reinforcement learning.

Literature DB >> 14692633

Inter-module credit assignment in modular reinforcement learning.

Kazuyuki Samejima¹, Kenji Doya, Mitsuo Kawato.

Abstract

Critical issues in modular or hierarchical reinforcement learning (RL) are (i) how to decompose a task into sub-tasks, (ii) how to achieve independence of learning of sub-tasks, and (iii) how to assure optimality of the composite policy for the entire task. The second and last requirements are often under trade-off. We propose a method for propagating the reward for the entire task achievement between modules. This is done in the form of a 'modular reward', which is calculated from the temporal difference of the module gating signal and the value of the succeeding module. We implement modular reward for a multiple model-based reinforcement learning (MMRL) architecture and show its effectiveness in simulations of a pursuit task with hidden states and a continuous-time non-linear control task.

Mesh：

Year: 2003 PMID： 14692633 DOI： 10.1016/S0893-6080(02)00235-6

Source DB: PubMed Journal: Neural Netw ISSN： 0893-6080

Keyword Cloud
Cited

7 in total

Inter-module credit assignment in modular reinforcement learning.

1. Computational models of reinforcement learning: the role of dopamine as a reward signal.

2. Visuomotor coordination and cortical connectivity of modular motor learning.

3. Shifting responsibly: the importance of striatal modularity to reinforcement learning in uncertain environments.

4. Credit assignment in multiple goal embodied visuomotor behavior.

5. From internal models toward metacognitive AI.

6. Temporal-difference reinforcement learning with distributed representations.

7. Modeling sensory-motor decisions in natural behavior.