Literature DB >> 25582684

Do learning rates adapt to the distribution of rewards?

Samuel J Gershman1.   

Abstract

Studies of reinforcement learning have shown that humans learn differently in response to positive and negative reward prediction errors, a phenomenon that can be captured computationally by positing asymmetric learning rates. This asymmetry, motivated by neurobiological and cognitive considerations, has been invoked to explain learning differences across the lifespan as well as a range of psychiatric disorders. Recent theoretical work, motivated by normative considerations, has hypothesized that the learning rate asymmetry should be modulated by the distribution of rewards across the available options. In particular, the learning rate for negative prediction errors should be higher than the learning rate for positive prediction errors when the average reward rate is high, and this relationship should reverse when the reward rate is low. We tested this hypothesis in a series of experiments. Contrary to the theoretical predictions, we found that the asymmetry was largely insensitive to the average reward rate; instead, the dominant pattern was a higher learning rate for negative than for positive prediction errors, possibly reflecting risk aversion.

Entities:  

Keywords:  Decision-making; Multi-armed bandit; Reinforcement learning

Mesh:

Year:  2015        PMID: 25582684     DOI: 10.3758/s13423-014-0790-3

Source DB:  PubMed          Journal:  Psychon Bull Rev        ISSN: 1069-9384


  18 in total

Review 1.  Learning and selective attention.

Authors:  P Dayan; S Kakade; P R Montague
Journal:  Nat Neurosci       Date:  2000-11       Impact factor: 24.884

Review 2.  Opponent interactions between serotonin and dopamine.

Authors:  Nathaniel D Daw; Sham Kakade; Peter Dayan
Journal:  Neural Netw       Date:  2002 Jun-Jul

Review 3.  Metalearning and neuromodulation.

Authors:  Kenji Doya
Journal:  Neural Netw       Date:  2002 Jun-Jul

4.  Neural and psychological maturation of decision-making in adolescence and young adulthood.

Authors:  Anastasia Christakou; Samuel J Gershman; Yael Niv; Andrew Simmons; Mick Brammer; Katya Rubia
Journal:  J Cogn Neurosci       Date:  2013-07-16       Impact factor: 3.225

5.  Adaptive properties of differential learning rates for positive and negative outcomes.

Authors:  Romain D Cazé; Matthijs A A van der Meer
Journal:  Biol Cybern       Date:  2013-10-02       Impact factor: 2.086

6.  Dopaminergic drugs modulate learning rates and perseveration in Parkinson's patients in a dynamic foraging task.

Authors:  Robb B Rutledge; Stephanie C Lazzaro; Brian Lau; Catherine E Myers; Mark A Gluck; Paul W Glimcher
Journal:  J Neurosci       Date:  2009-12-02       Impact factor: 6.167

7.  Genetic triple dissociation reveals multiple roles for dopamine in reinforcement learning.

Authors:  Michael J Frank; Ahmed A Moustafa; Heather M Haughey; Tim Curran; Kent E Hutchison
Journal:  Proc Natl Acad Sci U S A       Date:  2007-10-03       Impact factor: 11.205

8.  Bayesian model selection for group studies.

Authors:  Klaas Enno Stephan; Will D Penny; Jean Daunizeau; Rosalyn J Moran; Karl J Friston
Journal:  Neuroimage       Date:  2009-03-20       Impact factor: 6.556

9.  Learning the value of information in an uncertain world.

Authors:  Timothy E J Behrens; Mark W Woolrich; Mark E Walton; Matthew F S Rushworth
Journal:  Nat Neurosci       Date:  2007-08-05       Impact factor: 24.884

10.  Evaluating Amazon's Mechanical Turk as a tool for experimental behavioral research.

Authors:  Matthew J C Crump; John V McDonnell; Todd M Gureckis
Journal:  PLoS One       Date:  2013-03-13       Impact factor: 3.240

View more
  29 in total

1.  An Obesity-Predisposing Variant of the FTO Gene Regulates D2R-Dependent Reward Learning.

Authors:  Meltem Sevgi; Lionel Rigoux; Anne B Kühn; Jan Mauer; Leonhard Schilbach; Martin E Hess; Theo O J Gruendler; Markus Ullsperger; Klaas Enno Stephan; Jens C Brüning; Marc Tittgemeyer
Journal:  J Neurosci       Date:  2015-09-09       Impact factor: 6.167

2.  Causal Inference About Good and Bad Outcomes.

Authors:  Hayley M Dorfman; Rahul Bhui; Brent L Hughes; Samuel J Gershman
Journal:  Psychol Sci       Date:  2019-02-13

3.  Credit assignment in movement-dependent reinforcement learning.

Authors:  Samuel D McDougle; Matthew J Boggess; Matthew J Crossley; Darius Parvin; Richard B Ivry; Jordan A Taylor
Journal:  Proc Natl Acad Sci U S A       Date:  2016-05-31       Impact factor: 11.205

Review 4.  The relative merit of empirical priors in non-identifiable and sloppy models: Applications to models of learning and decision-making : Empirical priors.

Authors:  Mikhail S Spektor; David Kellen
Journal:  Psychon Bull Rev       Date:  2018-12

5.  How learning shapes the empathic brain.

Authors:  Grit Hein; Jan B Engelmann; Marius C Vollberg; Philippe N Tobler
Journal:  Proc Natl Acad Sci U S A       Date:  2015-12-22       Impact factor: 11.205

6.  Neural Signatures of Prediction Errors in a Decision-Making Task Are Modulated by Action Execution Failures.

Authors:  Samuel D McDougle; Peter A Butcher; Darius E Parvin; Fasial Mushtaq; Yael Niv; Richard B Ivry; Jordan A Taylor
Journal:  Curr Biol       Date:  2019-05-02       Impact factor: 10.834

7.  The drift diffusion model as the choice rule in reinforcement learning.

Authors:  Mads Lund Pedersen; Michael J Frank; Guido Biele
Journal:  Psychon Bull Rev       Date:  2017-08

8.  The Outcome-Representation Learning Model: A Novel Reinforcement Learning Model of the Iowa Gambling Task.

Authors:  Nathaniel Haines; Jasmin Vassileva; Woo-Young Ahn
Journal:  Cogn Sci       Date:  2018-10-05

9.  When Implicit Prosociality Trumps Selfishness: The Neural Valuation System Underpins More Optimal Choices When Learning to Avoid Harm to Others Than to Oneself.

Authors:  Lukas L Lengersdorff; Isabella C Wagner; Patricia L Lockwood; Claus Lamm
Journal:  J Neurosci       Date:  2020-08-24       Impact factor: 6.167

10.  Pain relief provided by an outgroup member enhances analgesia.

Authors:  Grit Hein; Jan B Engelmann; Philippe N Tobler
Journal:  Proc Biol Sci       Date:  2018-09-26       Impact factor: 5.349

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.