Literature DB >> 33500989

Stage-Wise Learning of Reaching Using Little Prior Knowledge.

François de La Bourdonnaye1, Céline Teulière1, Jochen Triesch2, Thierry Chateau1.   

Abstract

In some manipulation robotics environments, because of the difficulty of precisely modeling dynamics and computing features which describe well the variety of scene appearances, hand-programming a robot behavior is often intractable. Deep reinforcement learning methods partially alleviate this problem in that they can dispense with hand-crafted features for the state representation and do not need pre-computed dynamics. However, they often use prior information in the task definition in the form of shaping rewards which guide the robot toward goal state areas but require engineering or human supervision and can lead to sub-optimal behavior. In this work we consider a complex robot reaching task with a large range of initial object positions and initial arm positions and propose a new learning approach with minimal supervision. Inspired by developmental robotics, our method consists of a weakly-supervised stage-wise procedure of three tasks. First, the robot learns to fixate the object with a 2-camera system. Second, it learns hand-eye coordination by learning to fixate its end-effector. Third, using the knowledge acquired in the previous steps, it learns to reach the object at different positions and from a large set of initial robot joint angles. Experiments in a simulated environment show that our stage-wise framework yields similar reaching performances, compared with a supervised setting without using kinematic models, hand-crafted features, calibration parameters or supervised visual modules.
Copyright © 2018 de La Bourdonnaye, Teulière, Triesch and Chateau.

Entities:  

Keywords:  deep reinforcement learning; hierarchical learning; manipulation robotics; stage-wise learning; weakly-supervised

Year:  2018        PMID: 33500989      PMCID: PMC7806066          DOI: 10.3389/frobt.2018.00110

Source DB:  PubMed          Journal:  Front Robot AI        ISSN: 2296-9144


  5 in total

1.  Reach plans in eye-centered coordinates.

Authors:  A P Batista; C A Buneo; L H Snyder; R A Andersen
Journal:  Science       Date:  1999-07-09       Impact factor: 47.728

2.  The coupling between point-of-gaze and ball movements in three-ball cascade juggling: the effects of expertise, pattern and tempo.

Authors:  Raoul Huys; Peter J Beek
Journal:  J Sports Sci       Date:  2002-03       Impact factor: 3.337

3.  Learning visuomotor transformations for gaze-control and grasping.

Authors:  Heiko Hoffmann; Wolfram Schenck; Ralf Möller
Journal:  Biol Cybern       Date:  2005-07-18       Impact factor: 2.086

4.  Magnetic misreaching.

Authors:  D P Carey; R J Coleman; S Della Sala
Journal:  Cortex       Date:  1997-12       Impact factor: 4.027

5.  Human-level control through deep reinforcement learning.

Authors:  Volodymyr Mnih; Koray Kavukcuoglu; David Silver; Andrei A Rusu; Joel Veness; Marc G Bellemare; Alex Graves; Martin Riedmiller; Andreas K Fidjeland; Georg Ostrovski; Stig Petersen; Charles Beattie; Amir Sadik; Ioannis Antonoglou; Helen King; Dharshan Kumaran; Daan Wierstra; Shane Legg; Demis Hassabis
Journal:  Nature       Date:  2015-02-26       Impact factor: 49.962

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.