Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Reinforcement learning of motor skills with policy gradients.

Literature DB >> 18482830

Reinforcement learning of motor skills with policy gradients.

Abstract

Autonomous learning is one of the hallmarks of human and animal behavior, and understanding the principles of learning will be crucial in order to achieve true autonomy in advanced machines like humanoid robots. In this paper, we examine learning of complex motor skills with human-like limbs. While supervised learning can offer useful tools for bootstrapping behavior, e.g., by learning from demonstration, it is only reinforcement learning that offers a general approach to the final trial-and-error improvement that is needed by each individual acquiring a skill. Neither neurobiological nor machine learning studies have, so far, offered compelling results on how reinforcement learning can be scaled to the high-dimensional continuous state and action spaces of humans or humanoids. Here, we combine two recent research developments on learning motor control in order to achieve this scaling. First, we interpret the idea of modular motor control by means of motor primitives as a suitable way to generate parameterized control policies for reinforcement learning. Second, we combine motor primitives with the theory of stochastic policy gradient learning, which currently seems to be the only feasible framework for reinforcement learning for humanoids. We evaluate different policy gradient methods with a focus on their applicability to parameterized motor primitives. We compare these algorithms in the context of motor primitive learning, and show that our most modern algorithm, the Episodic Natural Actor-Critic outperforms previous algorithms by at least an order of magnitude. We demonstrate the efficiency of this reinforcement learning method in the application of learning to hit a baseball with an anthropomorphic robot arm.

Entities: Species

Mesh：

Year: 2008 PMID： 18482830 DOI： 10.1016/j.neunet.2008.02.003

Source DB: PubMed Journal: Neural Netw ISSN： 0893-6080

Keyword Cloud
Cited

28 in total

1. Robotic learning of motion using demonstrations and statistical models for surgical simulation.

Authors: Tao Yang; Chee Kong Chui; Jiang Liu; Weimin Huang; Yi Su; Stephen K Y Chang
Journal: Int J Comput Assist Radiol Surg Date: 2013-12-14 Impact factor: 2.924

2. An information-theoretic approach to curiosity-driven reinforcement learning.

Authors: Susanne Still; Doina Precup
Journal: Theory Biosci Date: 2012-07-12 Impact factor: 1.919

Review 3. The Role of Variability in Motor Learning.

Authors: Ashesh K Dhawale; Maurice A Smith; Bence P Ölveczky
Journal: Annu Rev Neurosci Date: 2017-05-10 Impact factor: 12.449

4. Adaptive Regulation of Motor Variability.

Authors: Ashesh K Dhawale; Yohsuke R Miyamoto; Maurice A Smith; Bence P Ölveczky
Journal: Curr Biol Date: 2019-10-17 Impact factor: 10.834

5. From known to unknown: moving to unvisited locations in a novel sensorimotor map.

Authors: Floris T van Vugt; David J Ostry
Journal: Ann N Y Acad Sci Date: 2018-03-03 Impact factor: 5.691

6. Visuomotor coordination and cortical connectivity of modular motor learning.

Authors: Pablo I Burgos; Juan J Mariman; Scott Makeig; Gonzalo Rivera-Lillo; Pedro E Maldonado
Journal: Hum Brain Mapp Date: 2018-05-15 Impact factor: 5.038

7. Learning to reach by reinforcement learning using a receptive field based function approximation approach with continuous actions.

Authors: Minija Tamosiunaite; Tamim Asfour; Florentin Wörgötter
Journal: Biol Cybern Date: 2009-02-20 Impact factor: 2.086

8. Learned graphical models for probabilistic planning provide a new class of movement primitives.

Authors: Elmar A Rückert; Gerhard Neumann; Marc Toussaint; Wolfgang Maass
Journal: Front Comput Neurosci Date: 2013-01-02 Impact factor: 2.380

9. When money is not enough: awareness, success, and variability in motor learning.

Authors: Harry Manley; Peter Dayan; Jörn Diedrichsen
Journal: PLoS One Date: 2014-01-28 Impact factor: 3.240

10. Learned parametrized dynamic movement primitives with shared synergies for controlling robotic and musculoskeletal systems.

Authors: Elmar Rückert; Andrea d'Avella
Journal: Front Comput Neurosci Date: 2013-10-17 Impact factor: 2.380