Literature DB >> 33501210

Skill Learning by Autonomous Robotic Playing Using Active Learning and Exploratory Behavior Composition.

Simon Hangl1, Vedran Dunjko2, Hans J Briegel3, Justus Piater1.   

Abstract

We consider the problem of autonomous acquisition of manipulation skills where problem-solving strategies are initially available only for a narrow range of situations. We propose to extend the range of solvable situations by autonomous play with the object. By applying previously-trained skills and behaviors, the robot learns how to prepare situations for which a successful strategy is already known. The information gathered during autonomous play is additionally used to train an environment model. This model is exploited for active learning and the generation of novel preparatory behaviors compositions. We apply our approach to a wide range of different manipulation tasks, e.g., book grasping, grasping of objects of different sizes by selecting different grasping strategies, placement on shelves, and tower disassembly. We show that the composite behavior generation mechanism enables the robot to solve previously-unsolvable tasks, e.g., tower disassembly. We use success statistics gained during real-world experiments to simulate the convergence behavior of our system. Simulation experiments show that the learning speed can be improved by around 30% by using active learning.
Copyright © 2020 Hangl, Dunjko, Briegel and Piater.

Entities:  

Keywords:  active learning; autonomous robotics; behavior composition; hierarchical models; reinforcement learning; robotic manipulation; skill learning

Year:  2020        PMID: 33501210      PMCID: PMC7806109          DOI: 10.3389/frobt.2020.00042

Source DB:  PubMed          Journal:  Front Robot AI        ISSN: 2296-9144


  14 in total

1.  Predictability, complexity, and learning.

Authors:  W Bialek; I Nemenman; N Tishby
Journal:  Neural Comput       Date:  2001-11       Impact factor: 2.026

2.  Neural Network Exploration Using Optimal Experiment Design.

Authors:  David A. Cohn
Journal:  Neural Netw       Date:  1996-08

3.  Path planning versus cue responding: a bio-inspired model of switching between navigation strategies.

Authors:  Laurent Dollé; Denis Sheynikhovich; Benoît Girard; Ricardo Chavarriaga; Agnès Guillot
Journal:  Biol Cybern       Date:  2010-07-09       Impact factor: 2.086

Review 4.  Habits, action sequences and reinforcement learning.

Authors:  Amir Dezfouli; Bernard W Balleine
Journal:  Eur J Neurosci       Date:  2012-04       Impact factor: 3.386

5.  On creative machines and the physical origins of freedom.

Authors:  Hans J Briegel
Journal:  Sci Rep       Date:  2012-07-20       Impact factor: 4.379

6.  Projective simulation for artificial intelligence.

Authors:  Hans J Briegel; Gemma De las Cuevas
Journal:  Sci Rep       Date:  2012-05-15       Impact factor: 4.379

7.  Speed/accuracy trade-off between the habitual and the goal-directed processes.

Authors:  Mehdi Keramati; Amir Dezfouli; Payam Piray
Journal:  PLoS Comput Biol       Date:  2011-05-26       Impact factor: 4.475

8.  On the convergence of projective-simulation-based reinforcement learning in Markov decision processes.

Authors:  W L Boyajian; J Clausen; L M Trenkwalder; V Dunjko; H J Briegel
Journal:  Quantum Mach Intell       Date:  2020-11-05

9.  Information driven self-organization of complex robotic behaviors.

Authors:  Georg Martius; Ralf Der; Nihat Ay
Journal:  PLoS One       Date:  2013-05-27       Impact factor: 3.240

10.  Adaptive quantum computation in changing environments using projective simulation.

Authors:  M Tiersch; E J Ganahl; H J Briegel
Journal:  Sci Rep       Date:  2015-08-11       Impact factor: 4.379

View more
  1 in total

1.  Honeybee communication during collective defence is shaped by predation.

Authors:  Andrea López-Incera; Morgane Nouvian; Katja Ried; Thomas Müller; Hans J Briegel
Journal:  BMC Biol       Date:  2021-05-25       Impact factor: 7.431

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.