Literature DB >> 33504073

Learning-Based End-to-End Path Planning for Lunar Rovers with Safety Constraints.

Xiaoqiang Yu1, Ping Wang2, Zexu Zhang1.   

Abstract

Path planning is an essential technology for lunar rover to achieve safe and efficient autonomous exploration mission, this paper proposes a learning-based end-to-end path planning algorithm for lunar rovers with safety constraints. Firstly, a training environment integrating real lunar surface terrain data was built using the Gazebo simulation environment and a lunar rover simulator was created in it to simulate the real lunar surface environment and the lunar rover system. Then an end-to-end path planning algorithm based on deep reinforcement learning method is designed, including state space, action space, network structure, reward function considering slip behavior, and training method based on proximal policy optimization. In addition, to improve the generalization ability to different lunar surface topography and different scale environments, a variety of training scenarios were set up to train the network model using the idea of curriculum learning. The simulation results show that the proposed planning algorithm can successfully achieve the end-to-end path planning of the lunar rover, and the path generated by the proposed algorithm has a higher safety guarantee compared with the classical path planning algorithm.

Entities:  

Keywords:  deep reinforcement learning; learning-based; lunar rovers; path planning

Year:  2021        PMID: 33504073      PMCID: PMC7866010          DOI: 10.3390/s21030796

Source DB:  PubMed          Journal:  Sensors (Basel)        ISSN: 1424-8220            Impact factor:   3.576


  6 in total

1.  China's present and future lunar exploration program.

Authors:  Chunlai Li; Chi Wang; Yong Wei; Yangting Lin
Journal:  Science       Date:  2019-07-19       Impact factor: 47.728

2.  Human-level control through deep reinforcement learning.

Authors:  Volodymyr Mnih; Koray Kavukcuoglu; David Silver; Andrei A Rusu; Joel Veness; Marc G Bellemare; Alex Graves; Martin Riedmiller; Andreas K Fidjeland; Georg Ostrovski; Stig Petersen; Charles Beattie; Amir Sadik; Ioannis Antonoglou; Helen King; Dharshan Kumaran; Daan Wierstra; Shane Legg; Demis Hassabis
Journal:  Nature       Date:  2015-02-26       Impact factor: 49.962

3.  Vision-Based Robot Navigation through Combining Unsupervised Learning and Hierarchical Reinforcement Learning.

Authors:  Xiaomao Zhou; Tao Bai; Yanbin Gao; Yuntao Han
Journal:  Sensors (Basel)       Date:  2019-04-01       Impact factor: 3.576

4.  Deep Multi-Layer Perception Based Terrain Classification for Planetary Exploration Rovers.

Authors:  Chengchao Bai; Jifeng Guo; Linli Guo; Junlin Song
Journal:  Sensors (Basel)       Date:  2019-07-13       Impact factor: 3.576

5.  A Two-Stage Method for Target Searching in the Path Planning for Mobile Robots.

Authors:  Tao Song; Xiang Huo; Xinkai Wu
Journal:  Sensors (Basel)       Date:  2020-12-03       Impact factor: 3.576

  6 in total
  1 in total

Review 1.  Path Planning for Autonomous Mobile Robots: A Review.

Authors:  José Ricardo Sánchez-Ibáñez; Carlos J Pérez-Del-Pulgar; Alfonso García-Cerezo
Journal:  Sensors (Basel)       Date:  2021-11-26       Impact factor: 3.576

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.