| Literature DB >> 35494543 |
Fabio Muratore1,2, Fabio Ramos3,4, Greg Turk5, Wenhao Yu6, Michael Gienger2, Jan Peters1.
Abstract
The rise of deep learning has caused a paradigm shift in robotics research, favoring methods that require large amounts of data. Unfortunately, it is prohibitively expensive to generate such data sets on a physical platform. Therefore, state-of-the-art approaches learn in simulation where data generation is fast as well as inexpensive and subsequently transfer the knowledge to the real robot (sim-to-real). Despite becoming increasingly realistic, all simulators are by construction based on models, hence inevitably imperfect. This raises the question of how simulators can be modified to facilitate learning robot control policies and overcome the mismatch between simulation and reality, often called the "reality gap." We provide a comprehensive review of sim-to-real research for robotics, focusing on a technique named "domain randomization" which is a method for learning from randomized simulations.Entities:
Keywords: domain randomization; reality gap; reinforcement learning; robotics; sim-to-real; simulation; simulation optimization bias
Year: 2022 PMID: 35494543 PMCID: PMC9038844 DOI: 10.3389/frobt.2022.799893
Source DB: PubMed Journal: Front Robot AI ISSN: 2296-9144
FIGURE 1Examples of sim-to-real robot learning research using domain randomization: (left) Multiple simulation instances of robotic in-hand manipulation (OpenAI et al., 2020), (middle top) transformation to a canonical simulation (James et al., 2019), (middle bottom) synthetic 3D hallways generated for indoor drone flight (Sadeghi and Levine, 2017), (right top) ball-in-a-cup task solved with adaptive dynamics randomization (Muratore et al., 2021a), (right bottom) quadruped locomotion (Tan et al., 2018).
FIGURE 2Topological overview of the sim-to-real research and a selection of related fields.
FIGURE 3Topological overview of domain randomization methods.
FIGURE 4Conceptual illustration of static domain randomization.
FIGURE 5Conceptual illustration of adaptive domain randomization.
FIGURE 6Conceptual illustration of adversarial domain randomization.