Literature DB >> 22257174

When, what, and how much to reward in reinforcement learning-based models of cognition.

Christian P Janssen1, Wayne D Gray.   

Abstract

Reinforcement learning approaches to cognitive modeling represent task acquisition as learning to choose the sequence of steps that accomplishes the task while maximizing a reward. However, an apparently unrecognized problem for modelers is choosing when, what, and how much to reward; that is, when (the moment: end of trial, subtask, or some other interval of task performance), what (the objective function: e.g., performance time or performance accuracy), and how much (the magnitude: with binary, categorical, or continuous values). In this article, we explore the problem space of these three parameters in the context of a task whose completion entails some combination of 36 state-action pairs, where all intermediate states (i.e., after the initial state and prior to the end state) represent progressive but partial completion of the task. Different choices produce profoundly different learning paths and outcomes, with the strongest effect for moment. Unfortunately, there is little discussion in the literature of the effect of such choices. This absence is disappointing, as the choice of when, what, and how much needs to be made by a modeler for every learning model.
Copyright © 2012 Cognitive Science Society, Inc.

Entities:  

Mesh:

Year:  2012        PMID: 22257174     DOI: 10.1111/j.1551-6709.2011.01222.x

Source DB:  PubMed          Journal:  Cogn Sci        ISSN: 0364-0213


  3 in total

Review 1.  Navigating complex decision spaces: Problems and paradigms in sequential choice.

Authors:  Matthew M Walsh; John R Anderson
Journal:  Psychol Bull       Date:  2013-07-08       Impact factor: 17.737

2.  Strategic Adaptation to Task Characteristics, Incentives, and Individual Differences in Dual-Tasking.

Authors:  Christian P Janssen; Duncan P Brumby
Journal:  PLoS One       Date:  2015-07-10       Impact factor: 3.240

3.  Dividing Attention Between Tasks: Testing Whether Explicit Payoff Functions Elicit Optimal Dual-Task Performance.

Authors:  George D Farmer; Christian P Janssen; Anh T Nguyen; Duncan P Brumby
Journal:  Cogn Sci       Date:  2017-06-27
  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.