Literature DB >> 19003453

Parallel reinforcement learning for weighted multi-criteria model with adaptive margin.

Kazuyuki Hiraoka1, Manabu Yoshida, Taketoshi Mishima.   

Abstract

Reinforcement learning (RL) for a linear family of tasks is described in this paper. The key of our discussion is nonlinearity of the optimal solution even if the task family is linear; we cannot obtain the optimal policy using a naive approach. Although an algorithm exists for calculating the equivalent result to Q-learning for each task simultaneously, it presents the problem of explosion of set sizes. We therefore introduce adaptive margins to overcome this difficulty.

Year:  2008        PMID: 19003453      PMCID: PMC2645492          DOI: 10.1007/s11571-008-9066-9

Source DB:  PubMed          Journal:  Cogn Neurodyn        ISSN: 1871-4080            Impact factor:   5.082


  2 in total

1.  Definitions of state variables and state space for brain-computer interface : Part 1. Multiple hierarchical levels of brain function.

Authors:  Walter J Freeman
Journal:  Cogn Neurodyn       Date:  2006-12-07       Impact factor: 5.082

2.  Interaction between the Spatiotemporal Learning Rule (STLR) and Hebb type (HEBB) in single pyramidal cells in the hippocampal CA1 Area.

Authors:  Minoru Tsukada; Yoshiyuki Yamazaki; Hiroshi Kojima
Journal:  Cogn Neurodyn       Date:  2007-02-07       Impact factor: 5.082

  2 in total
  1 in total

1.  Computational models of reinforcement learning: the role of dopamine as a reward signal.

Authors:  R D Samson; M J Frank; Jean-Marc Fellous
Journal:  Cogn Neurodyn       Date:  2010-03-21       Impact factor: 5.082

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.