Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Reinforcement learning for a biped robot based on a CPG-actor-critic method.

Literature DB >> 17412559

Reinforcement learning for a biped robot based on a CPG-actor-critic method.

Yutaka Nakamura¹, Takeshi Mori, Masa-aki Sato, Shin Ishii.

Abstract

Animals' rhythmic movements, such as locomotion, are considered to be controlled by neural circuits called central pattern generators (CPGs), which generate oscillatory signals. Motivated by this biological mechanism, studies have been conducted on the rhythmic movements controlled by CPG. As an autonomous learning framework for a CPG controller, we propose in this article a reinforcement learning method we call the "CPG-actor-critic" method. This method introduces a new architecture to the actor, and its training is roughly based on a stochastic policy gradient algorithm presented recently. We apply this method to an automatic acquisition problem of control for a biped robot. Computer simulations show that training of the CPG can be successfully performed by our method, thus allowing the biped robot to not only walk stably but also adapt to environmental changes.

Mesh：

Year: 2007 PMID： 17412559 DOI： 10.1016/j.neunet.2007.01.002

Source DB: PubMed Journal: Neural Netw ISSN： 0893-6080

Keyword Cloud
Cited

5 in total

1. Exploring Behaviors of Caterpillar-Like Soft Robots with a Central Pattern Generator-Based Controller and Reinforcement Learning.

Authors: Matthew Ishige; Takuya Umedachi; Tadahiro Taniguchi; Yoshihiro Kawahara
Journal: Soft Robot Date: 2019-05-20 Impact factor: 8.071

2. A neurorobotic platform to test the influence of neuromodulatory signaling on anxious and curious behavior.

Authors: Jeffrey L Krichmar
Journal: Front Neurorobot Date: 2013-02-05 Impact factor: 2.650

3. A novel approach to locomotion learning: Actor-Critic architecture using central pattern generators and dynamic motor primitives.

Authors: Cai Li; Robert Lowe; Tom Ziemke
Journal: Front Neurorobot Date: 2014-10-02 Impact factor: 2.650

4. Fast Dynamical Coupling Enhances Frequency Adaptation of Oscillators for Robotic Locomotion Control.

Authors: Timo Nachstedt; Christian Tetzlaff; Poramate Manoonpong
Journal: Front Neurorobot Date: 2017-03-21 Impact factor: 2.650

5. Humanoids Learning to Walk: A Natural CPG-Actor-Critic Architecture.

Authors: Cai Li; Robert Lowe; Tom Ziemke
Journal: Front Neurorobot Date: 2013-04-08 Impact factor: 2.650

5 in total