Literature DB >> 33501166

Adaptive Prior Selection for Repertoire-Based Online Adaptation in Robotics.

Rituraj Kaushik1, Pierre Desreumaux1, Jean-Baptiste Mouret1.   

Abstract

Repertoire-based learning is a data-efficient adaptation approach based on a two-step process in which (1) a large and diverse set of policies is learned in simulation, and (2) a planning or learning algorithm chooses the most appropriate policies according to the current situation (e.g., a damaged robot, a new object, etc.). In this paper, we relax the assumption of previous works that a single repertoire is enough for adaptation. Instead, we generate repertoires for many different situations (e.g., with a missing leg, on different floors, etc.) and let our algorithm selects the most useful prior. Our main contribution is an algorithm, APROL (Adaptive Prior selection for Repertoire-based Online Learning) to plan the next action by incorporating these priors when the robot has no information about the current situation. We evaluate APROL on two simulated tasks: (1) pushing unknown objects of various shapes and sizes with a robotic arm and (2) a goal reaching task with a damaged hexapod robot. We compare with "Reset-free Trial and Error" (RTE) and various single repertoire-based baselines. The results show that APROL solves both the tasks in less interaction time than the baselines. Additionally, we demonstrate APROL on a real, damaged hexapod that quickly learns to pick compensatory policies to reach a goal by avoiding obstacles in the path.
Copyright © 2020 Kaushik, Desreumaux and Mouret.

Entities:  

Keywords:  data-efficient robot learning; evolutionary robotics; fault tolerance in robotics; model-based learning; repertoire-based robot learning

Year:  2020        PMID: 33501166      PMCID: PMC7805922          DOI: 10.3389/frobt.2019.00151

Source DB:  PubMed          Journal:  Front Robot AI        ISSN: 2296-9144


  5 in total

1.  Mastering the game of Go with deep neural networks and tree search.

Authors:  David Silver; Aja Huang; Chris J Maddison; Arthur Guez; Laurent Sifre; George van den Driessche; Julian Schrittwieser; Ioannis Antonoglou; Veda Panneershelvam; Marc Lanctot; Sander Dieleman; Dominik Grewe; John Nham; Nal Kalchbrenner; Ilya Sutskever; Timothy Lillicrap; Madeleine Leach; Koray Kavukcuoglu; Thore Graepel; Demis Hassabis
Journal:  Nature       Date:  2016-01-28       Impact factor: 49.962

2.  Gaussian Processes for Data-Efficient Learning in Robotics and Control.

Authors:  Marc Peter Deisenroth; Dieter Fox; Carl Edward Rasmussen
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2015-02       Impact factor: 6.226

3.  Robots that can adapt like animals.

Authors:  Antoine Cully; Jeff Clune; Danesh Tarapore; Jean-Baptiste Mouret
Journal:  Nature       Date:  2015-05-28       Impact factor: 49.962

4.  Evolving a Behavioral Repertoire for a Walking Robot.

Authors:  A Cully; J-B Mouret
Journal:  Evol Comput       Date:  2015-01-13       Impact factor: 3.277

5.  Human-level control through deep reinforcement learning.

Authors:  Volodymyr Mnih; Koray Kavukcuoglu; David Silver; Andrei A Rusu; Joel Veness; Marc G Bellemare; Alex Graves; Martin Riedmiller; Andreas K Fidjeland; Georg Ostrovski; Stig Petersen; Charles Beattie; Amir Sadik; Ioannis Antonoglou; Helen King; Dharshan Kumaran; Daan Wierstra; Shane Legg; Demis Hassabis
Journal:  Nature       Date:  2015-02-26       Impact factor: 49.962

  5 in total
  1 in total

1.  Improvements in Medical System Safety Analytics for Authentic Measure of Vital Signs Using Fault-Tolerant Design Approach.

Authors:  Prasadraju Lakkamraju; Madhu Anumukonda; Shubhajit Roy Chowdhury
Journal:  Front Med Technol       Date:  2021-08-25
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.