| Literature DB >> 19003453 |
Kazuyuki Hiraoka1, Manabu Yoshida, Taketoshi Mishima.
Abstract
Reinforcement learning (RL) for a linear family of tasks is described in this paper. The key of our discussion is nonlinearity of the optimal solution even if the task family is linear; we cannot obtain the optimal policy using a naive approach. Although an algorithm exists for calculating the equivalent result to Q-learning for each task simultaneously, it presents the problem of explosion of set sizes. We therefore introduce adaptive margins to overcome this difficulty.Year: 2008 PMID: 19003453 PMCID: PMC2645492 DOI: 10.1007/s11571-008-9066-9
Source DB: PubMed Journal: Cogn Neurodyn ISSN: 1871-4080 Impact factor: 5.082