Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Intrinsically motivated action-outcome learning and goal-based action recall: a system-level bio-constrained computational model.

Literature DB >> 23098753

Intrinsically motivated action-outcome learning and goal-based action recall: a system-level bio-constrained computational model.

Gianluca Baldassarre¹, Francesco Mannella, Vincenzo G Fiore, Peter Redgrave, Kevin Gurney, Marco Mirolli.

Abstract

Reinforcement (trial-and-error) learning in animals is driven by a multitude of processes. Most animals have evolved several sophisticated systems of 'extrinsic motivations' (EMs) that guide them to acquire behaviours allowing them to maintain their bodies, defend against threat, and reproduce. Animals have also evolved various systems of 'intrinsic motivations' (IMs) that allow them to acquire actions in the absence of extrinsic rewards. These actions are used later to pursue such rewards when they become available. Intrinsic motivations have been studied in Psychology for many decades and their biological substrates are now being elucidated by neuroscientists. In the last two decades, investigators in computational modelling, robotics and machine learning have proposed various mechanisms that capture certain aspects of IMs. However, we still lack models of IMs that attempt to integrate all key aspects of intrinsically motivated learning and behaviour while taking into account the relevant neurobiological constraints. This paper proposes a bio-constrained system-level model that contributes a major step towards this integration. The model focusses on three processes related to IMs and on the neural mechanisms underlying them: (a) the acquisition of action-outcome associations (internal models of the agent-environment interaction) driven by phasic dopamine signals caused by sudden, unexpected changes in the environment; (b) the transient focussing of visual gaze and actions on salient portions of the environment; (c) the subsequent recall of actions to pursue extrinsic rewards based on goal-directed reactivation of the representations of their outcomes. The tests of the model, including a series of selective lesions, show how the focussing processes lead to a faster learning of action-outcome associations, and how these associations can be recruited for accomplishing goal-directed behaviours. The model, together with the background knowledge reviewed in the paper, represents a framework that can be used to guide the design and interpretation of empirical experiments on IMs, and to computationally validate and further develop theories on them.

Entities: Chemical

Mesh：

Substances：
Dopamine

Year: 2012 PMID： 23098753 DOI： 10.1016/j.neunet.2012.09.015

Source DB: PubMed Journal: Neural Netw ISSN： 0893-6080

Keyword Cloud
Cited

19 in total

Review 1. Basal Ganglia and Thalamic Contributions to Language Function: Insights from A Parallel Distributed Processing Perspective.

Authors: Stephen E Nadeau
Journal: Neuropsychol Rev Date: 2021-01-29 Impact factor: 7.444

2. Development of goal-directed action selection guided by intrinsic motivations: an experiment with children.

Authors: Fabrizio Taffoni; Eleonora Tamilia; Valentina Focaroli; Domenico Formica; Luca Ricci; Giovanni Di Pino; Gianluca Baldassarre; Marco Mirolli; Eugenio Guglielmelli; Flavio Keller
Journal: Exp Brain Res Date: 2014-04-02 Impact factor: 1.972

3. Intrinsic motivations drive learning of eye movements: an experiment with human adults.

Authors: Daniele Caligiore; Magda Mustile; Daniele Cipriani; Peter Redgrave; Jochen Triesch; Maria De Marsico; Gianluca Baldassarre
Journal: PLoS One Date: 2015-03-16 Impact factor: 3.240

4. Keep focussing: striatal dopamine multiple functions resolved in a single mechanism tested in a simulated humanoid robot.

Authors: Vincenzo G Fiore; Valerio Sperati; Francesco Mannella; Marco Mirolli; Kevin Gurney; Karl Friston; Raymond J Dolan; Gianluca Baldassarre
Journal: Front Psychol Date: 2014-02-21

5. An intrinsic value system for developing multiple invariant representations with incremental slowness learning.

Authors: Matthew Luciw; Varun Kompella; Sohrob Kazerounian; Juergen Schmidhuber
Journal: Front Neurorobot Date: 2013-05-30 Impact factor: 2.650

6. A biologically plausible embodied model of action discovery.

Authors: Rufino Bolado-Gomez; Kevin Gurney
Journal: Front Neurorobot Date: 2013-03-12 Impact factor: 2.650

7. Corticolimbic catecholamines in stress: a computational model of the appraisal of controllability.

Authors: Vincenzo G Fiore; Francesco Mannella; Marco Mirolli; Emanuele Claudio Latagliata; Alessandro Valzania; Simona Cabib; Raymond J Dolan; Stefano Puglisi-Allegra; Gianluca Baldassarre
Journal: Brain Struct Funct Date: 2014-02-28 Impact factor: 3.270

8. Evolutionarily conserved mechanisms for the selection and maintenance of behavioural activity.

Authors: Vincenzo G Fiore; Raymond J Dolan; Nicholas J Strausfeld; Frank Hirth
Journal: Philos Trans R Soc Lond B Biol Sci Date: 2015-12-19 Impact factor: 6.237

Review 9. Nonhuman gamblers: lessons from rodents, primates, and robots.

Authors: Fabio Paglieri; Elsa Addessi; Francesca De Petrillo; Giovanni Laviola; Marco Mirolli; Domenico Parisi; Giancarlo Petrosino; Marialba Ventricelli; Francesca Zoratto; Walter Adriani
Journal: Front Behav Neurosci Date: 2014-02-11 Impact factor: 3.558

Review 10. Emergent structured transition from variation to repetition in a biologically-plausible model of learning in basal ganglia.

Authors: Ashvin Shah; Kevin N Gurney
Journal: Front Psychol Date: 2014-02-11