Literature DB >> 12576101

Meta-learning in reinforcement learning.

Nicolas Schweighofer1, Kenji Doya.   

Abstract

Meta-parameters in reinforcement learning should be tuned to the environmental dynamics and the animal performance. Here, we propose a biologically plausible meta-reinforcement learning algorithm for tuning these meta-parameters in a dynamic, adaptive manner. We tested our algorithm in both a simulation of a Markov decision task and in a non-linear control task. Our results show that the algorithm robustly finds appropriate meta-parameter values, and controls the meta-parameter time course, in both static and dynamic environments. We suggest that the phasic and tonic components of dopamine neuron firing can encode the signal required for meta-learning of reinforcement learning.

Entities:  

Mesh:

Year:  2003        PMID: 12576101     DOI: 10.1016/s0893-6080(02)00228-9

Source DB:  PubMed          Journal:  Neural Netw        ISSN: 0893-6080


  21 in total

1.  A neural circuit model of flexible sensorimotor mapping: learning and forgetting on multiple timescales.

Authors:  Stefano Fusi; Wael F Asaad; Earl K Miller; Xiao-Jing Wang
Journal:  Neuron       Date:  2007-04-19       Impact factor: 17.173

2.  Metaplasticity as a Neural Substrate for Adaptive Learning and Choice under Uncertainty.

Authors:  Shiva Farashahi; Christopher H Donahue; Peyman Khorsand; Hyojung Seo; Daeyeol Lee; Alireza Soltani
Journal:  Neuron       Date:  2017-04-19       Impact factor: 17.173

3.  Catecholaminergic modulation of meta-learning.

Authors:  Hanneke Em den Ouden; Roshan Cools; Jennifer L Cook; Jennifer C Swart; Monja I Froböse; Andreea O Diaconescu; Dirk Em Geurts
Journal:  Elife       Date:  2019-12-18       Impact factor: 8.140

4.  A possible correlation between the basal ganglia motor function and the inverse kinematics calculation.

Authors:  Armin Salimi-Badr; Mohammad Mehdi Ebadzadeh; Christian Darlot
Journal:  J Comput Neurosci       Date:  2017-10-23       Impact factor: 1.621

5.  Cortical mechanisms for reinforcement learning in competitive games.

Authors:  Hyojung Seo; Daeyeol Lee
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  2008-12-12       Impact factor: 6.237

6.  Dual adaptation supports a parallel architecture of motor memory.

Authors:  Jeong-Yoon Lee; Nicolas Schweighofer
Journal:  J Neurosci       Date:  2009-08-19       Impact factor: 6.167

7.  The New Robotics-towards human-centered machines.

Authors:  Stefan Schaal
Journal:  HFSP J       Date:  2007-07-16

8.  Neural mechanism for stochastic behaviour during a competitive game.

Authors:  Alireza Soltani; Daeyeol Lee; Xiao-Jing Wang
Journal:  Neural Netw       Date:  2006-10

9.  Use it and improve it or lose it: interactions between arm function and use in humans post-stroke.

Authors:  Yukikazu Hidaka; Cheol E Han; Steven L Wolf; Carolee J Winstein; Nicolas Schweighofer
Journal:  PLoS Comput Biol       Date:  2012-02-16       Impact factor: 4.475

10.  An imperfect dopaminergic error signal can drive temporal-difference learning.

Authors:  Wiebke Potjans; Markus Diesmann; Abigail Morrison
Journal:  PLoS Comput Biol       Date:  2011-05-12       Impact factor: 4.475

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.