Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Multiple model-based reinforcement learning.

Literature DB >> 12020450

Multiple model-based reinforcement learning.

Kenji Doya¹, Kazuyuki Samejima, Ken-ichi Katagiri, Mitsuo Kawato.

Abstract

We propose a modular reinforcement learning architecture for nonlinear, nonstationary control tasks, which we call multiple model-based reinforcement learning (MMRL). The basic idea is to decompose a complex task into multiple domains in space and time based on the predictability of the environmental dynamics. The system is composed of multiple modules, each of which consists of a state prediction model and a reinforcement learning controller. The "responsibility signal," which is given by the softmax function of the prediction errors, is used to weight the outputs of multiple modules, as well as to gate the learning of the prediction models and the reinforcement learning controllers. We formulate MMRL for both discrete-time, finite-state case and continuous-time, continuous-state case. The performance of MMRL was demonstrated for discrete case in a nonstationary hunting task in a grid world and for continuous case in a nonlinear, nonstationary control task of swinging up a pendulum with variable physical parameters.

Mesh：

Year: 2002 PMID： 12020450 DOI： 10.1162/089976602753712972

Source DB: PubMed Journal: Neural Comput ISSN： 0899-7667 Impact factor: 2.026

Keyword Cloud
Cited

80 in total

1. A unifying computational framework for motor control and social interaction.

Authors: Daniel M Wolpert; Kenji Doya; Mitsuo Kawato
Journal: Philos Trans R Soc Lond B Biol Sci Date: 2003-03-29 Impact factor: 6.237

2. Functional magnetic resonance imaging examination of two modular architectures for switching multiple internal models.

Authors: Hiroshi Imamizu; Tomoe Kuroda; Toshinori Yoshioka; Mitsuo Kawato
Journal: J Neurosci Date: 2004-02-04 Impact factor: 6.167

Multiple model-based reinforcement learning.

1. A unifying computational framework for motor control and social interaction.

2. Functional magnetic resonance imaging examination of two modular architectures for switching multiple internal models.

3. Abstract rule learning: the differential effects of lesions in frontal cortex.

4. A pallidus-habenula-dopamine pathway signals inferred stimulus values.

5. Protection and expression of human motor memories.

6. A model of prefrontal cortical mechanisms for goal-directed behavior.

7. Neural correlates of the divergence of instrumental probability distributions.

8. Mechanisms of hierarchical reinforcement learning in corticostriatal circuits 1: computational analysis.

9. Computational models of reinforcement learning: the role of dopamine as a reward signal.

10. Temporal-difference reinforcement learning with distributed representations.