Literature DB >> 25214675

A Comparison Model of Reinforcement-Learning and Win-Stay-Lose-Shift Decision-Making Processes: A Tribute to W.K. Estes.

Darrell A Worthy1, W Todd Maddox2.   

Abstract

W.K. Estes often championed an approach to model development whereby an existing model was augmented by the addition of one or more free parameters, and a comparison between the simple and more complex, augmented model determined whether the additions were justified. Following this same approach we utilized Estes' (1950) own augmented learning equations to improve the fit and plausibility of a win-stay-lose-shift (WSLS) model that we have used in much of our recent work. Estes also championed models that assumed a comparison between multiple concurrent cognitive processes. In line with this, we develop a WSLS-Reinforcement Learning (RL) model that assumes that the output of a WSLS process that provides a probability of staying or switching to a different option based on the last two decision outcomes is compared with the output of an RL process that determines a probability of selecting each option based on a comparison of the expected value of each option. Fits to data from three different decision-making experiments suggest that the augmentations to the WSLS and RL models lead to a better account of decision-making behavior. Our results also support the assertion that human participants weigh the output of WSLS and RL processes during decision-making.

Entities:  

Keywords:  Decision-making; dual-process; mathematical modeling; reinforcement learning; win-stay-lose-shift

Year:  2014        PMID: 25214675      PMCID: PMC4159167          DOI: 10.1016/j.jmp.2013.10.001

Source DB:  PubMed          Journal:  J Math Psychol        ISSN: 0022-2496            Impact factor:   2.223


  23 in total

1.  Traps in the route to models of memory and decision.

Authors:  W K Estes
Journal:  Psychon Bull Rev       Date:  2002-03

2.  Effect of prior patterns of experience upon strategies and learning sets.

Authors:  J J GOODNOW; T F PETTIGREW
Journal:  J Exp Psychol       Date:  1955-06

3.  AIC model selection using Akaike weights.

Authors:  Eric-Jan Wagenmakers; Simon Farrell
Journal:  Psychon Bull Rev       Date:  2004-02

4.  Comparison of basic assumptions embedded in learning models for experience-based decision making.

Authors:  Eldad Yechiam; Jerome R Busemeyer
Journal:  Psychon Bull Rev       Date:  2005-06

5.  Regulatory fit effects in a choice task.

Authors:  Darrell A Worthy; W Todd Maddox; Arthur B Markman
Journal:  Psychon Bull Rev       Date:  2007-12

6.  Processes of memory loss, recovery, and distortion.

Authors:  W K Estes
Journal:  Psychol Rev       Date:  1997-01       Impact factor: 8.934

7.  Regulatory fit and systematic exploration in a dynamic decision-making environment.

Authors:  A Ross Otto; Arthur B Markman; Todd M Gureckis; Bradley C Love
Journal:  J Exp Psychol Learn Mem Cogn       Date:  2010-05       Impact factor: 3.051

8.  A strategy of win-stay, lose-shift that outperforms tit-for-tat in the Prisoner's Dilemma game.

Authors:  M Nowak; K Sigmund
Journal:  Nature       Date:  1993-07-01       Impact factor: 49.962

9.  Short-term memory traces for action bias in human reinforcement learning.

Authors:  Rafal Bogacz; Samuel M McClure; Jian Li; Jonathan D Cohen; P Read Montague
Journal:  Brain Res       Date:  2007-03-24       Impact factor: 3.252

10.  Learning in Noise: Dynamic Decision-Making in a Variable Environment.

Authors:  Todd M Gureckis; Bradley C Love
Journal:  J Math Psychol       Date:  2009-06       Impact factor: 2.223

View more
  23 in total

1.  Altered behavioral and neural responsiveness to counterfactual gains in the elderly.

Authors:  Michael J Tobia; Rong Guo; Jan Gläscher; Ulrike Schwarze; Stefanie Brassen; Christian Büchel; Klaus Obermayer; Tobias Sommer
Journal:  Cogn Affect Behav Neurosci       Date:  2016-06       Impact factor: 3.282

2.  Reinforcement learning models of risky choice and the promotion of risk-taking by losses disguised as wins in rats.

Authors:  Andrew T Marshall; Kimberly Kirkpatrick
Journal:  J Exp Psychol Anim Learn Cogn       Date:  2017-07       Impact factor: 2.478

3.  Learning reward frequency over reward probability: A tale of two learning rules.

Authors:  Hilary J Don; A Ross Otto; Astin C Cornwall; Tyler Davis; Darrell A Worthy
Journal:  Cognition       Date:  2019-08-17

4.  Deciphering Age Differences in Experience-Based Decision-Making: The Role of Sleep.

Authors:  Xue-Rui Peng; Yun-Rui Liu; Dong-Qiong Fan; Xu Lei; Quan-Ying Liu; Jing Yu
Journal:  Nat Sci Sleep       Date:  2020-09-29

5.  The effect of obstructed action efficacy on reward-based decision-making in healthy adolescents: a novel functional MRI task to assay frustration.

Authors:  Katia M Harlé; Tiffany C Ho; Colm G Connolly; Alan N Simmons; Tony T Yang
Journal:  Cogn Affect Behav Neurosci       Date:  2021-12-29       Impact factor: 3.526

6.  Age-related impairments on the touchscreen paired associates learning (PAL) task in male rats.

Authors:  Samantha M Smith; Sabrina Zequeira; Meena Ravi; Sarah A Johnson; Andriena M Hampton; Aleyna M Ross; Wonn Pyon; Andrew P Maurer; Jennifer L Bizon; Sara N Burke
Journal:  Neurobiol Aging       Date:  2021-10-02       Impact factor: 5.133

7.  Development of a novel computational model for the Balloon Analogue Risk Task: The Exponential-Weight Mean-Variance Model.

Authors:  Harhim Park; Jaeyeong Yang; Jasmin Vassileva; Woo-Young Ahn
Journal:  J Math Psychol       Date:  2021-04-21       Impact factor: 1.387

8.  A Case of Divergent Predictions Made by Delta and Decay Rule Learning Models.

Authors:  Darrell A Worthy; A Ross Otto; Astin C Cornwall; Hilary J Don; Tyler Davis
Journal:  Cogsci       Date:  2018-07

9.  Role of dopamine D2 receptors in optimizing choice strategy in a dynamic and uncertain environment.

Authors:  Shinae Kwak; Namjung Huh; Ji-Seon Seo; Jung-Eun Lee; Pyung-Lim Han; Min W Jung
Journal:  Front Behav Neurosci       Date:  2014-10-28       Impact factor: 3.558

10.  Neuronal activity in dorsomedial and dorsolateral striatum under the requirement for temporal credit assignment.

Authors:  Eun Sil Her; Namjung Huh; Jieun Kim; Min Whan Jung
Journal:  Sci Rep       Date:  2016-06-01       Impact factor: 4.379

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.