Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Parallel Online Temporal Difference Learning for Motor Control.

Literature DB >> 26111402

Parallel Online Temporal Difference Learning for Motor Control.

Abstract

Temporal difference (TD) learning, a key concept in reinforcement learning, is a popular method for solving simulated control problems. However, in real systems, this method is often avoided in favor of policy search methods because of its long learning time. But policy search suffers from its own drawbacks, such as the necessity of informed policy parameterization and initialization. In this paper, we show that TD learning can work effectively in real robotic systems as well, using parallel model learning and planning. Using locally weighted linear regression and trajectory sampled planning with 14 concurrent threads, we can achieve a speedup of almost two orders of magnitude over regular TD control on simulated control benchmarks. For a real-world pendulum swing-up task and a two-link manipulator movement task, we report a speedup of 20× to 60× , with a real-time learning speed of less than half a minute. The results are competitive with state-of-the-art policy search.

Year: 2015 PMID： 26111402 DOI： 10.1109/TNNLS.2015.2442233

Source DB: PubMed Journal: IEEE Trans Neural Netw Learn Syst ISSN： 2162-237X Impact factor: 10.451

Keyword Cloud
Cited

3 in total

1. Reactive Reinforcement Learning in Asynchronous Environments.

Authors: Jaden B Travnik; Kory W Mathewson; Richard S Sutton; Patrick M Pilarski
Journal: Front Robot AI Date: 2018-06-26

2. Model Learning and Knowledge Sharing for Cooperative Multiagent Systems in Stochastic Environment.

Authors: Wei-Cheng Jiang; Vignesh Narayanan; Jr-Shin Li
Journal: IEEE Trans Cybern Date: 2021-12-22 Impact factor: 11.448

3. Reinforcement Learning (RL)-Based Energy Efficient Resource Allocation for Energy Harvesting-Powered Wireless Body Area Network.

Authors: Yi-Han Xu; Jing-Wei Xie; Yang-Gang Zhang; Min Hua; Wen Zhou
Journal: Sensors (Basel) Date: 2019-12-19 Impact factor: 3.576

3 in total