Literature DB >> 21629583

Computational models of reinforcement learning: the role of dopamine as a reward signal.

R D Samson, M J Frank, Jean-Marc Fellous.   

Abstract

Reinforcement learning is ubiquitous. Unlike other forms of learning, it involves the processing of fast yet content-poor feedback information to correct assumptions about the nature of a task or of a set of stimuli. This feedback information is often delivered as generic rewards or punishments, and has little to do with the stimulus features to be learned. How can such low-content feedback lead to such an efficient learning paradigm? Through a review of existing neuro-computational models of reinforcement learning, we suggest that the efficiency of this type of learning resides in the dynamic and synergistic cooperation of brain systems that use different levels of computations. The implementation of reward signals at the synaptic, cellular, network and system levels give the organism the necessary robustness, adaptability and processing speed required for evolutionary and behavioral success.

Entities:  

Keywords:  Dopamine; Reinforcement learning; Reward; Temporal difference

Year:  2010        PMID: 21629583      PMCID: PMC2866366          DOI: 10.1007/s11571-010-9109-x

Source DB:  PubMed          Journal:  Cogn Neurodyn        ISSN: 1871-4080            Impact factor:   5.082


  160 in total

1.  Multiple model-based reinforcement learning.

Authors:  Kenji Doya; Kazuyuki Samejima; Ken-ichi Katagiri; Mitsuo Kawato
Journal:  Neural Comput       Date:  2002-06       Impact factor: 2.026

2.  Postsynaptic modulation of AMPA- and NMDA-receptor currents by Group III metabotropic glutamate receptors in rat nucleus accumbens.

Authors:  Stefano Taverna; Cyriel M A Pennartz
Journal:  Brain Res       Date:  2003-06-20       Impact factor: 3.252

Review 3.  Neuroeconomics: the consilience of brain and decision.

Authors:  Paul W Glimcher; Aldo Rustichini
Journal:  Science       Date:  2004-10-15       Impact factor: 47.728

4.  Inverted-U dopamine D1 receptor actions on prefrontal neurons engaged in working memory.

Authors:  Susheel Vijayraghavan; Min Wang; Shari G Birnbaum; Graham V Williams; Amy F T Arnsten
Journal:  Nat Neurosci       Date:  2007-02-04       Impact factor: 24.884

5.  A spiking neural network model of an actor-critic learning agent.

Authors:  Wiebke Potjans; Abigail Morrison; Markus Diesmann
Journal:  Neural Comput       Date:  2009-02       Impact factor: 2.026

Review 6.  Review. The incentive sensitization theory of addiction: some current issues.

Authors:  Terry E Robinson; Kent C Berridge
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  2008-10-12       Impact factor: 6.237

Review 7.  A neural substrate of prediction and reward.

Authors:  W Schultz; P Dayan; P R Montague
Journal:  Science       Date:  1997-03-14       Impact factor: 47.728

8.  Dopamine neurons encode the better option in rats deciding between differently delayed or sized rewards.

Authors:  Matthew R Roesch; Donna J Calu; Geoffrey Schoenbaum
Journal:  Nat Neurosci       Date:  2007-11-18       Impact factor: 24.884

9.  Dopamine neurons of the monkey midbrain: contingencies of responses to active touch during self-initiated arm movements.

Authors:  R Romo; W Schultz
Journal:  J Neurophysiol       Date:  1990-03       Impact factor: 2.714

Review 10.  Brain dopamine and reward.

Authors:  R A Wise; P P Rompre
Journal:  Annu Rev Psychol       Date:  1989       Impact factor: 24.137

View more
  26 in total

Review 1.  The role of efference copy in striatal learning.

Authors:  Michale S Fee
Journal:  Curr Opin Neurobiol       Date:  2014-02-21       Impact factor: 6.627

2.  Offline reactivation of experience-dependent neuronal firing patterns in the rat ventral tegmental area.

Authors:  José L Valdés; Bruce L McNaughton; Jean-Marc Fellous
Journal:  J Neurophysiol       Date:  2015-06-24       Impact factor: 2.714

3.  Route searching based on neural networks and heuristic reinforcement learning.

Authors:  Fengyun Zhang; Shukai Duan; Lidan Wang
Journal:  Cogn Neurodyn       Date:  2017-02-09       Impact factor: 5.082

Review 4.  Domain generality versus modality specificity: the paradox of statistical learning.

Authors:  Ram Frost; Blair C Armstrong; Noam Siegelman; Morten H Christiansen
Journal:  Trends Cogn Sci       Date:  2015-01-24       Impact factor: 20.229

5.  Cannabinoids and value-based decision making: implications for neurodegenerative disorders.

Authors:  Angela M Lee; Erik B Oleson; Leontien Diergaarde; Joseph F Cheer; Tommy Pattij
Journal:  Basal Ganglia       Date:  2012-07-28

6.  Dynamic interplay among homeostatic, hedonic, and cognitive feedback circuits regulating body weight.

Authors:  Kevin D Hall; Ross A Hammond; Hazhir Rahmandad
Journal:  Am J Public Health       Date:  2014-05-15       Impact factor: 9.308

7.  Separate neural representations of prediction error valence and surprise: Evidence from an fMRI meta-analysis.

Authors:  Elsa Fouragnan; Chris Retzler; Marios G Philiastides
Journal:  Hum Brain Mapp       Date:  2018-03-25       Impact factor: 5.038

8.  Learning-dependent plasticity in human auditory cortex during appetitive operant conditioning.

Authors:  Sebastian Puschmann; André Brechmann; Christiane M Thiel
Journal:  Hum Brain Mapp       Date:  2012-05-19       Impact factor: 5.038

9.  A possible correlation between the basal ganglia motor function and the inverse kinematics calculation.

Authors:  Armin Salimi-Badr; Mohammad Mehdi Ebadzadeh; Christian Darlot
Journal:  J Comput Neurosci       Date:  2017-10-23       Impact factor: 1.621

10.  Deep brain stimulation amplitude alters posture shift velocity in Parkinson's disease.

Authors:  Narayanan Krishnamurthi; Stefani Mulligan; Padma Mahant; Johan Samanta; James J Abbas
Journal:  Cogn Neurodyn       Date:  2012-04-12       Impact factor: 5.082

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.