Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Meta-learning in reinforcement learning.

Literature DB >> 12576101

Meta-learning in reinforcement learning.

Abstract

Meta-parameters in reinforcement learning should be tuned to the environmental dynamics and the animal performance. Here, we propose a biologically plausible meta-reinforcement learning algorithm for tuning these meta-parameters in a dynamic, adaptive manner. We tested our algorithm in both a simulation of a Markov decision task and in a non-linear control task. Our results show that the algorithm robustly finds appropriate meta-parameter values, and controls the meta-parameter time course, in both static and dynamic environments. We suggest that the phasic and tonic components of dopamine neuron firing can encode the signal required for meta-learning of reinforcement learning.

Entities: Chemical

Mesh：

Year: 2003 PMID： 12576101 DOI： 10.1016/s0893-6080(02)00228-9

Source DB: PubMed Journal: Neural Netw ISSN： 0893-6080

Keyword Cloud
Cited

21 in total

Meta-learning in reinforcement learning.

1. A neural circuit model of flexible sensorimotor mapping: learning and forgetting on multiple timescales.

2. Metaplasticity as a Neural Substrate for Adaptive Learning and Choice under Uncertainty.

3. Catecholaminergic modulation of meta-learning.

4. A possible correlation between the basal ganglia motor function and the inverse kinematics calculation.

5. Cortical mechanisms for reinforcement learning in competitive games.

6. Dual adaptation supports a parallel architecture of motor memory.

7. The New Robotics-towards human-centered machines.

8. Neural mechanism for stochastic behaviour during a competitive game.

9. Use it and improve it or lose it: interactions between arm function and use in humans post-stroke.

10. An imperfect dopaminergic error signal can drive temporal-difference learning.