Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Phasic dopamine as a prediction error of intrinsic and extrinsic reinforcements driving both action acquisition and reward maximization: a simulated robotic study.

Literature DB >> 23353115

Phasic dopamine as a prediction error of intrinsic and extrinsic reinforcements driving both action acquisition and reward maximization: a simulated robotic study.

Marco Mirolli¹, Vieri G Santucci, Gianluca Baldassarre.

Abstract

An important issue of recent neuroscientific research is to understand the functional role of the phasic release of dopamine in the striatum, and in particular its relation to reinforcement learning. The literature is split between two alternative hypotheses: one considers phasic dopamine as a reward prediction error similar to the computational TD-error, whose function is to guide an animal to maximize future rewards; the other holds that phasic dopamine is a sensory prediction error signal that lets the animal discover and acquire novel actions. In this paper we propose an original hypothesis that integrates these two contrasting positions: according to our view phasic dopamine represents a TD-like reinforcement prediction error learning signal determined by both unexpected changes in the environment (temporary, intrinsic reinforcements) and biological rewards (permanent, extrinsic reinforcements). Accordingly, dopamine plays the functional role of driving both the discovery and acquisition of novel actions and the maximization of future rewards. To validate our hypothesis we perform a series of experiments with a simulated robotic system that has to learn different skills in order to get rewards. We compare different versions of the system in which we vary the composition of the learning signal. The results show that only the system reinforced by both extrinsic and intrinsic reinforcements is able to reach high performance in sufficiently complex conditions.

Entities: Chemical Disease

Mesh：

Substances：
Dopamine

Year: 2013 PMID： 23353115 DOI： 10.1016/j.neunet.2012.12.012

Source DB: PubMed Journal: Neural Netw ISSN： 0893-6080

Keyword Cloud
Cited

11 in total

1. Development of goal-directed action selection guided by intrinsic motivations: an experiment with children.

Authors: Fabrizio Taffoni; Eleonora Tamilia; Valentina Focaroli; Domenico Formica; Luca Ricci; Giovanni Di Pino; Gianluca Baldassarre; Marco Mirolli; Eugenio Guglielmelli; Flavio Keller
Journal: Exp Brain Res Date: 2014-04-02 Impact factor: 1.972

2. Intrinsic motivations drive learning of eye movements: an experiment with human adults.

Authors: Daniele Caligiore; Magda Mustile; Daniele Cipriani; Peter Redgrave; Jochen Triesch; Maria De Marsico; Gianluca Baldassarre
Journal: PLoS One Date: 2015-03-16 Impact factor: 3.240

3. Keep focussing: striatal dopamine multiple functions resolved in a single mechanism tested in a simulated humanoid robot.

Authors: Vincenzo G Fiore; Valerio Sperati; Francesco Mannella; Marco Mirolli; Kevin Gurney; Karl Friston; Raymond J Dolan; Gianluca Baldassarre
Journal: Front Psychol Date: 2014-02-21

4. The nucleus accumbens as a nexus between values and goals in goal-directed behavior: a review and a new hypothesis.

Authors: Francesco Mannella; Kevin Gurney; Gianluca Baldassarre
Journal: Front Behav Neurosci Date: 2013-10-23 Impact factor: 3.558

5. Modeling effects of intrinsic and extrinsic rewards on the competition between striatal learning systems.

Authors: Joschka Boedecker; Thomas Lampe; Martin Riedmiller
Journal: Front Psychol Date: 2013-10-16

6. Which is the best intrinsic motivation signal for learning multiple skills?

Authors: Vieri G Santucci; Gianluca Baldassarre; Marco Mirolli
Journal: Front Neurorobot Date: 2013-11-12 Impact factor: 2.650

Review 7. Nonhuman gamblers: lessons from rodents, primates, and robots.

Authors: Fabio Paglieri; Elsa Addessi; Francesca De Petrillo; Giovanni Laviola; Marco Mirolli; Domenico Parisi; Giancarlo Petrosino; Marialba Ventricelli; Francesca Zoratto; Walter Adriani
Journal: Front Behav Neurosci Date: 2014-02-11 Impact factor: 3.558

Review 8. Emergent structured transition from variation to repetition in a biologically-plausible model of learning in basal ganglia.

Authors: Ashvin Shah; Kevin N Gurney
Journal: Front Psychol Date: 2014-02-11

9. Novelty or surprise?

Authors: Andrew Barto; Marco Mirolli; Gianluca Baldassarre
Journal: Front Psychol Date: 2013-12-11

10. Intrinsic motivations and open-ended development in animals, humans, and robots: an overview.

Authors: Gianluca Baldassarre; Tom Stafford; Marco Mirolli; Peter Redgrave; Richard M Ryan; Andrew Barto
Journal: Front Psychol Date: 2014-09-09