Literature DB >> 34283119

General Purpose Low-Level Reinforcement Learning Control for Multi-Axis Rotor Aerial Vehicles.

Chen-Huan Pi1, Yi-Wei Dai1, Kai-Chun Hu2, Stone Cheng1.   

Abstract

This paper proposes a multipurpose reinforcement learning based low-level multirotor unmanned aerial vehicles control structure constructed using neural networks with model-free training. Other low-level reinforcement learning controllers developed in studies have only been applicable to a model-specific and physical-parameter-specific multirotor, and time-consuming training is required when switching to a different vehicle. We use a 6-degree-of-freedom dynamic model combining acceleration-based control from the policy neural network to overcome these problems. The UAV automatically learns the maneuver by an end-to-end neural network from fusion states to acceleration command. The state estimation is performed using the data from on-board sensors and motion capture. The motion capture system provides spatial position information and a multisensory fusion framework fuses the measurement from the onboard inertia measurement units for compensating the time delay and low update frequency of the capture system. Without requiring expert demonstration, the trained control policy implemented using an improved algorithm can be applied to various multirotors with the output directly mapped to actuators. The algorithm's ability to control multirotors in the hovering and the tracking task is evaluated. Through simulation and actual experiments, we demonstrate the flight control with a quadrotor and hexrotor by using the trained policy. With the same policy, we verify that we can stabilize the quadrotor and hexrotor in the air under random initial states.

Entities:  

Keywords:  quadrotor; reinforcement learning; unmanned aerial vehicle

Year:  2021        PMID: 34283119     DOI: 10.3390/s21134560

Source DB:  PubMed          Journal:  Sensors (Basel)        ISSN: 1424-8220            Impact factor:   3.576


  1 in total

1.  A Reinforcement Learning Based Dirt-Exploration for Cleaning-Auditing Robot.

Authors:  Thejus Pathmakumar; Mohan Rajesh Elara; Braulio Félix Gómez; Balakrishnan Ramalingam
Journal:  Sensors (Basel)       Date:  2021-12-13       Impact factor: 3.576

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.