Literature DB >> 33501200

Reinforcement Learning-Based Tracking Control of USVs in Varying Operational Conditions.

Andreas B Martinsen1, Anastasios M Lekkas1,2, Sébastien Gros1, Jon Arne Glomsrud3, Tom Arne Pedersen3.   

Abstract

We present a reinforcement learning-based (RL) control scheme for trajectory tracking of fully-actuated surface vessels. The proposed method learns online both a model-based feedforward controller, as well an optimizing feedback policy in order to follow a desired trajectory under the influence of environmental forces. The method's efficiency is evaluated via simulations and sea trials, with the unmanned surface vehicle (USV) ReVolt performing three different tracking tasks: The four corner DP test, straight-path tracking and curved-path tracking. The results demonstrate the method's ability to accomplish the control objectives and a good agreement between the performance achieved in the Revolt Digital Twin and the sea trials. Finally, we include an section with considerations about assurance for RL-based methods and where our approach stands in terms of the main challenges.
Copyright © 2020 Martinsen, Lekkas, Gros, Glomsrud and Pedersen.

Entities:  

Keywords:  approximate dynamic programming (ADP); autonomous ships; dynamic positioning (DP); model-based adaptive control; optimal control; reinforcement learning; system identification; trajectory tracking

Year:  2020        PMID: 33501200      PMCID: PMC7806118          DOI: 10.3389/frobt.2020.00032

Source DB:  PubMed          Journal:  Front Robot AI        ISSN: 2296-9144


  2 in total

1.  Reinforcement learning in continuous time and space.

Authors:  K Doya
Journal:  Neural Comput       Date:  2000-01       Impact factor: 2.026

2.  Mastering the game of Go with deep neural networks and tree search.

Authors:  David Silver; Aja Huang; Chris J Maddison; Arthur Guez; Laurent Sifre; George van den Driessche; Julian Schrittwieser; Ioannis Antonoglou; Veda Panneershelvam; Marc Lanctot; Sander Dieleman; Dominik Grewe; John Nham; Nal Kalchbrenner; Ilya Sutskever; Timothy Lillicrap; Madeleine Leach; Koray Kavukcuoglu; Thore Graepel; Demis Hassabis
Journal:  Nature       Date:  2016-01-28       Impact factor: 49.962

  2 in total
  1 in total

1.  Estimating spatio-temporal fields through reinforcement learning.

Authors:  Paulo Padrao; Jose Fuentes; Leonardo Bobadilla; Ryan N Smith
Journal:  Front Robot AI       Date:  2022-09-05
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.