Literature DB >> 23069349

Learning to represent reward structure: a key to adapting to complex environments.

Hiroyuki Nakahara1, Okihide Hikosaka.   

Abstract

Predicting outcomes is a critical ability of humans and animals. The dopamine reward prediction error hypothesis, the driving force behind the recent progress in neural "value-based" decision making, states that dopamine activity encodes the signals for learning in order to predict a reward, that is, the difference between the actual and predicted reward, called the reward prediction error. However, this hypothesis and its underlying assumptions limit the prediction and its error as reactively triggered by momentary environmental events. Reviewing the assumptions and some of the latest findings, we suggest that the internal state representation is learned to reflect the environmental reward structure, and we propose a new hypothesis - the dopamine reward structural learning hypothesis - in which dopamine activity encodes multiplex signals for learning in order to represent reward structure in the internal state, leading to better reward prediction.
Copyright © 2012 Elsevier Ireland Ltd and the Japan Neuroscience Society. All rights reserved.

Entities:  

Mesh:

Substances:

Year:  2012        PMID: 23069349      PMCID: PMC3513573          DOI: 10.1016/j.neures.2012.09.007

Source DB:  PubMed          Journal:  Neurosci Res        ISSN: 0168-0102            Impact factor:   3.304


  67 in total

Review 1.  A common framework for perceptual learning.

Authors:  Aaron R Seitz; Hubert R Dinse
Journal:  Curr Opin Neurobiol       Date:  2007-02-20       Impact factor: 6.627

2.  Stimulus representation and the timing of reward-prediction errors in models of the dopamine system.

Authors:  Elliot A Ludvig; Richard S Sutton; E James Kehoe
Journal:  Neural Comput       Date:  2008-12       Impact factor: 2.026

Review 3.  A framework for studying the neurobiology of value-based decision making.

Authors:  Antonio Rangel; Colin Camerer; P Read Montague
Journal:  Nat Rev Neurosci       Date:  2008-06-11       Impact factor: 34.870

Review 4.  A neural substrate of prediction and reward.

Authors:  W Schultz; P Dayan; P R Montague
Journal:  Science       Date:  1997-03-14       Impact factor: 47.728

5.  Neurons in anterior cingulate cortex multiplex information about reward and action.

Authors:  Benjamin Y Hayden; Michael L Platt
Journal:  J Neurosci       Date:  2010-03-03       Impact factor: 6.167

Review 6.  Model-based learning and the contribution of the orbitofrontal cortex to the model-free world.

Authors:  Michael A McDannald; Yuji K Takahashi; Nina Lopatina; Brad W Pietras; Josh L Jones; Geoffrey Schoenbaum
Journal:  Eur J Neurosci       Date:  2012-04       Impact factor: 3.386

7.  Structure learning in human sequential decision-making.

Authors:  Daniel E Acuña; Paul Schrater
Journal:  PLoS Comput Biol       Date:  2010-12-02       Impact factor: 4.475

8.  Dopamine neurons encode the better option in rats deciding between differently delayed or sized rewards.

Authors:  Matthew R Roesch; Donna J Calu; Geoffrey Schoenbaum
Journal:  Nat Neurosci       Date:  2007-11-18       Impact factor: 24.884

9.  Neural mechanisms of foraging.

Authors:  Nils Kolling; Timothy E J Behrens; Rogier B Mars; Matthew F S Rushworth
Journal:  Science       Date:  2012-04-06       Impact factor: 47.728

10.  Uncertainty in action-value estimation affects both action choice and learning rate of the choice behaviors of rats.

Authors:  Akihiro Funamizu; Makoto Ito; Kenji Doya; Ryohei Kanzaki; Hirokazu Takahashi
Journal:  Eur J Neurosci       Date:  2012-04       Impact factor: 3.386

View more
  9 in total

1.  Rethinking dopamine as generalized prediction error.

Authors:  Matthew P H Gardner; Geoffrey Schoenbaum; Samuel J Gershman
Journal:  Proc Biol Sci       Date:  2018-11-21       Impact factor: 5.349

2.  Domain-Specific Working Memory, But Not Dopamine-Related Genetic Variability, Shapes Reward-Based Motor Learning.

Authors:  Peter Holland; Olivier Codol; Elizabeth Oxley; Madison Taylor; Elizabeth Hamshere; Shadiq Joseph; Laura Huffer; Joseph M Galea
Journal:  J Neurosci       Date:  2019-10-11       Impact factor: 6.167

Review 3.  Model-based predictions for dopamine.

Authors:  Angela J Langdon; Melissa J Sharpe; Geoffrey Schoenbaum; Yael Niv
Journal:  Curr Opin Neurobiol       Date:  2017-10-31       Impact factor: 6.627

4.  Reinforcement learning with associative or discriminative generalization across states and actions: fMRI at 3 T and 7 T.

Authors:  Jaron T Colas; Neil M Dundon; Raphael T Gerraty; Natalie M Saragosa-Harris; Karol P Szymula; Koranis Tanwisuth; J Michael Tyszka; Camilla van Geen; Harang Ju; Arthur W Toga; Joshua I Gold; Dani S Bassett; Catherine A Hartley; Daphna Shohamy; Scott T Grafton; John P O'Doherty
Journal:  Hum Brain Mapp       Date:  2022-07-21       Impact factor: 5.399

5.  Context-Dependent Multiplexing by Individual VTA Dopamine Neurons.

Authors:  Yves Kremer; Jérôme Flakowski; Clément Rohner; Christian Lüscher
Journal:  J Neurosci       Date:  2020-08-28       Impact factor: 6.167

6.  Learning where to look for a hidden target.

Authors:  Leanne Chukoskie; Joseph Snider; Michael C Mozer; Richard J Krauzlis; Terrence J Sejnowski
Journal:  Proc Natl Acad Sci U S A       Date:  2013-06-10       Impact factor: 11.205

7.  Dual reward prediction components yield Pavlovian sign- and goal-tracking.

Authors:  Sivaramakrishnan Kaveri; Hiroyuki Nakahara
Journal:  PLoS One       Date:  2014-10-13       Impact factor: 3.240

Review 8.  The Dopamine Prediction Error: Contributions to Associative Models of Reward Learning.

Authors:  Helen M Nasser; Donna J Calu; Geoffrey Schoenbaum; Melissa J Sharpe
Journal:  Front Psychol       Date:  2017-02-22

Review 9.  Heads for learning, tails for memory: reward, reinforcement and a role of dopamine in determining behavioral relevance across multiple timescales.

Authors:  Mathieu Baudonnat; Anna Huber; Vincent David; Mark E Walton
Journal:  Front Neurosci       Date:  2013-10-11       Impact factor: 4.677

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.