Literature DB >> 20805507

Alterations in choice behavior by manipulations of world model.

C S Green1, C Benson, D Kersten, P Schrater.   

Abstract

How to compute initially unknown reward values makes up one of the key problems in reinforcement learning theory, with two basic approaches being used. Model-free algorithms rely on the accumulation of substantial amounts of experience to compute the value of actions, whereas in model-based learning, the agent seeks to learn the generative process for outcomes from which the value of actions can be predicted. Here we show that (i) "probability matching"-a consistent example of suboptimal choice behavior seen in humans-occurs in an optimal Bayesian model-based learner using a max decision rule that is initialized with ecologically plausible, but incorrect beliefs about the generative process for outcomes and (ii) human behavior can be strongly and predictably altered by the presence of cues suggestive of various generative processes, despite statistically identical outcome generation. These results suggest human decision making is rational and model based and not consistent with model-free learning.

Entities:  

Mesh:

Year:  2010        PMID: 20805507      PMCID: PMC2941269          DOI: 10.1073/pnas.1001709107

Source DB:  PubMed          Journal:  Proc Natl Acad Sci U S A        ISSN: 0027-8424            Impact factor:   11.205


  18 in total

1.  Probability matching: encouraging optimal responding in humans.

Authors:  Edmund Fantino; Ali Esfandiari
Journal:  Can J Exp Psychol       Date:  2002-03

2.  Determinants of choice-distribution in two-choice situations.

Authors:  J J GOODNOW
Journal:  Am J Psychol       Date:  1955-03

3.  Probability learning in 1000 trials.

Authors:  W EDWARDS
Journal:  J Exp Psychol       Date:  1961-10

4.  Some factors in probability matching.

Authors:  I RUBINSTEIN
Journal:  J Exp Psychol       Date:  1959-06

5.  On the negative recency hypothesis in the prediction of a series of binary symbols.

Authors:  J FELDMAN
Journal:  Am J Psychol       Date:  1959-12

6.  Perception of the statistical structure of a random series of binary symbols.

Authors:  H W HAKE; R HYMAN
Journal:  J Exp Psychol       Date:  1953-01

7.  The Psychophysics Toolbox.

Authors:  D H Brainard
Journal:  Spat Vis       Date:  1997

8.  The VideoToolbox software for visual psychophysics: transforming numbers into movies.

Authors:  D G Pelli
Journal:  Spat Vis       Date:  1997

9.  When and why do people avoid unknown probabilities in decisions under uncertainty? Testing some predictions from optimal foraging theory.

Authors:  C Rode; L Cosmides; W Hell; J Tooby
Journal:  Cognition       Date:  1999-10-26

10.  Is probability matching smart? Associations between probabilistic choices and cognitive ability.

Authors:  Keith E Stanovich
Journal:  Mem Cognit       Date:  2003-03
View more
  33 in total

1.  Lateral prefrontal cortex contributes to maladaptive decisions.

Authors:  Gui Xue; Chi-Hung Juan; Chi-Fu Chang; Zhong-Lin Lu; Qi Dong
Journal:  Proc Natl Acad Sci U S A       Date:  2012-03-05       Impact factor: 11.205

2.  Updating representations of temporal intervals.

Authors:  James Danckert; Britt Anderson
Journal:  Exp Brain Res       Date:  2015-08-25       Impact factor: 1.972

3.  Credit Assignment in a Motor Decision Making Task Is Influenced by Agency and Not Sensory Prediction Errors.

Authors:  Darius E Parvin; Samuel D McDougle; Jordan A Taylor; Richard B Ivry
Journal:  J Neurosci       Date:  2018-04-12       Impact factor: 6.167

4.  Probabilistic cognition in two indigenous Mayan groups.

Authors:  Laura Fontanari; Michel Gonzalez; Giorgio Vallortigara; Vittorio Girotto
Journal:  Proc Natl Acad Sci U S A       Date:  2014-11-03       Impact factor: 11.205

5.  Cognitive control over learning: creating, clustering, and generalizing task-set structure.

Authors:  Anne G E Collins; Michael J Frank
Journal:  Psychol Rev       Date:  2013-01       Impact factor: 8.934

6.  The brain uses adaptive internal models of scene statistics for sensorimotor estimation and planning.

Authors:  Oh-Sang Kwon; David C Knill
Journal:  Proc Natl Acad Sci U S A       Date:  2013-02-25       Impact factor: 11.205

7.  Learning bundles of stimuli renders stimulus order as a cue, not a confound.

Authors:  Ting Qian; Richard N Aslin
Journal:  Proc Natl Acad Sci U S A       Date:  2014-09-22       Impact factor: 11.205

8.  Decision from Models: Generalizing Probability Information to Novel Tasks.

Authors:  Hang Zhang; Jacienta T Paily; Laurence T Maloney
Journal:  Decision (Wash D C )       Date:  2015-01

9.  Neural Signatures of Prediction Errors in a Decision-Making Task Are Modulated by Action Execution Failures.

Authors:  Samuel D McDougle; Peter A Butcher; Darius E Parvin; Fasial Mushtaq; Yael Niv; Richard B Ivry; Jordan A Taylor
Journal:  Curr Biol       Date:  2019-05-02       Impact factor: 10.834

10.  Learning to represent reward structure: a key to adapting to complex environments.

Authors:  Hiroyuki Nakahara; Okihide Hikosaka
Journal:  Neurosci Res       Date:  2012-10-13       Impact factor: 3.304

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.