Literature DB >> 29971644

Optimal response vigor and choice under non-stationary outcome values.

Amir Dezfouli1, Bernard W Balleine2, Richard Nock3,4,5.   

Abstract

Within a rational framework, a decision-maker selects actions based on the reward-maximization principle, which stipulates that they acquire outcomes with the highest value at the lowest cost. Action selection can be divided into two dimensions: selecting an action from various alternatives, and choosing its vigor, i.e., how fast the selected action should be executed. Both of these dimensions depend on the values of outcomes, which are often affected as more outcomes are consumed together with their associated actions. Despite this, previous research has only addressed the computational substrate of optimal actions in the specific condition that the values of outcomes are constant. It is not known what actions are optimal when the values of outcomes are non-stationary. Here, based on an optimal control framework, we derive a computational model for optimal actions when outcome values are non-stationary. The results imply that, even when the values of outcomes are changing, the optimal response rate is constant rather than decreasing. This finding shows that, in contrast to previous theories, commonly observed changes in action rate cannot be attributed solely to changes in outcome value. We then prove that this observation can be explained based on uncertainty about temporal horizons; e.g., the session duration. We further show that, when multiple outcomes are available, the model explains probability matching as well as maximization strategies. The model therefore provides a quantitative analysis of optimal action and explicit predictions for future testing.

Keywords:  Choice; Optimal actions; Response vigor; Reward learning

Mesh:

Year:  2019        PMID: 29971644     DOI: 10.3758/s13423-018-1500-3

Source DB:  PubMed          Journal:  Psychon Bull Rev        ISSN: 1069-9384


  41 in total

1.  Taking the easy way out? Increasing implementation effort reduces probability maximizing under cognitive load.

Authors:  Christin Schulze; Ben R Newell
Journal:  Mem Cognit       Date:  2016-07

2.  Fixed-ratio level pressing by VMH rats: work vs accessibility of sucrose reward.

Authors:  J E Kelsey; J Allison
Journal:  Physiol Behav       Date:  1976-11

Review 3.  A normative perspective on motivation.

Authors:  Yael Niv; Daphna Joel; Peter Dayan
Journal:  Trends Cogn Sci       Date:  2006-07-13       Impact factor: 20.229

4.  Patterns of responding within sessions.

Authors:  F K McSweeney; J M Hinson
Journal:  J Exp Anal Behav       Date:  1992-07       Impact factor: 2.468

5.  The effects of motivation on response rate: a hidden semi-Markov model analysis of behavioral dynamics.

Authors:  Eran Eldar; Genela Morris; Yael Niv
Journal:  J Neurosci Methods       Date:  2011-07-18       Impact factor: 2.390

6.  Formation and control of optimal trajectory in human multijoint arm movement. Minimum torque-change model.

Authors:  Y Uno; M Kawato; R Suzuki
Journal:  Biol Cybern       Date:  1989       Impact factor: 2.086

7.  Food motivated behavior in genetically obese and hypothalamic-hyperphagic rats and mice.

Authors:  M R Greenwood; D Quartermain; P R Johnson; J A Cruce; J Hirsch
Journal:  Physiol Behav       Date:  1974-11

8.  Of matchers and maximizers: How competition shapes choice under risk and uncertainty.

Authors:  Christin Schulze; Don van Ravenzwaaij; Ben R Newell
Journal:  Cogn Psychol       Date:  2015-04-08       Impact factor: 3.468

9.  The smart potential behind probability matching.

Authors:  Wolfgang Gaissmaier; Lael J Schooler
Journal:  Cognition       Date:  2008-11-18

10.  An examination of the generalizability of motor costs.

Authors:  Max Berniker; Megan K O'Brien; Konrad P Kording; Alaa A Ahmed
Journal:  PLoS One       Date:  2013-01-14       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.