Literature DB >> 18482830

Reinforcement learning of motor skills with policy gradients.

Jan Peters1, Stefan Schaal.   

Abstract

Autonomous learning is one of the hallmarks of human and animal behavior, and understanding the principles of learning will be crucial in order to achieve true autonomy in advanced machines like humanoid robots. In this paper, we examine learning of complex motor skills with human-like limbs. While supervised learning can offer useful tools for bootstrapping behavior, e.g., by learning from demonstration, it is only reinforcement learning that offers a general approach to the final trial-and-error improvement that is needed by each individual acquiring a skill. Neither neurobiological nor machine learning studies have, so far, offered compelling results on how reinforcement learning can be scaled to the high-dimensional continuous state and action spaces of humans or humanoids. Here, we combine two recent research developments on learning motor control in order to achieve this scaling. First, we interpret the idea of modular motor control by means of motor primitives as a suitable way to generate parameterized control policies for reinforcement learning. Second, we combine motor primitives with the theory of stochastic policy gradient learning, which currently seems to be the only feasible framework for reinforcement learning for humanoids. We evaluate different policy gradient methods with a focus on their applicability to parameterized motor primitives. We compare these algorithms in the context of motor primitive learning, and show that our most modern algorithm, the Episodic Natural Actor-Critic outperforms previous algorithms by at least an order of magnitude. We demonstrate the efficiency of this reinforcement learning method in the application of learning to hit a baseball with an anthropomorphic robot arm.

Entities:  

Mesh:

Year:  2008        PMID: 18482830     DOI: 10.1016/j.neunet.2008.02.003

Source DB:  PubMed          Journal:  Neural Netw        ISSN: 0893-6080


  28 in total

1.  Robotic learning of motion using demonstrations and statistical models for surgical simulation.

Authors:  Tao Yang; Chee Kong Chui; Jiang Liu; Weimin Huang; Yi Su; Stephen K Y Chang
Journal:  Int J Comput Assist Radiol Surg       Date:  2013-12-14       Impact factor: 2.924

2.  An information-theoretic approach to curiosity-driven reinforcement learning.

Authors:  Susanne Still; Doina Precup
Journal:  Theory Biosci       Date:  2012-07-12       Impact factor: 1.919

Review 3.  The Role of Variability in Motor Learning.

Authors:  Ashesh K Dhawale; Maurice A Smith; Bence P Ölveczky
Journal:  Annu Rev Neurosci       Date:  2017-05-10       Impact factor: 12.449

4.  Adaptive Regulation of Motor Variability.

Authors:  Ashesh K Dhawale; Yohsuke R Miyamoto; Maurice A Smith; Bence P Ölveczky
Journal:  Curr Biol       Date:  2019-10-17       Impact factor: 10.834

5.  From known to unknown: moving to unvisited locations in a novel sensorimotor map.

Authors:  Floris T van Vugt; David J Ostry
Journal:  Ann N Y Acad Sci       Date:  2018-03-03       Impact factor: 5.691

6.  Visuomotor coordination and cortical connectivity of modular motor learning.

Authors:  Pablo I Burgos; Juan J Mariman; Scott Makeig; Gonzalo Rivera-Lillo; Pedro E Maldonado
Journal:  Hum Brain Mapp       Date:  2018-05-15       Impact factor: 5.038

7.  Learning to reach by reinforcement learning using a receptive field based function approximation approach with continuous actions.

Authors:  Minija Tamosiunaite; Tamim Asfour; Florentin Wörgötter
Journal:  Biol Cybern       Date:  2009-02-20       Impact factor: 2.086

8.  Learned graphical models for probabilistic planning provide a new class of movement primitives.

Authors:  Elmar A Rückert; Gerhard Neumann; Marc Toussaint; Wolfgang Maass
Journal:  Front Comput Neurosci       Date:  2013-01-02       Impact factor: 2.380

9.  When money is not enough: awareness, success, and variability in motor learning.

Authors:  Harry Manley; Peter Dayan; Jörn Diedrichsen
Journal:  PLoS One       Date:  2014-01-28       Impact factor: 3.240

10.  Learned parametrized dynamic movement primitives with shared synergies for controlling robotic and musculoskeletal systems.

Authors:  Elmar Rückert; Andrea d'Avella
Journal:  Front Comput Neurosci       Date:  2013-10-17       Impact factor: 2.380

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.