Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Reinforcement Learning-Based Tracking Control of USVs in Varying Operational Conditions.

Literature DB >> 33501200

Reinforcement Learning-Based Tracking Control of USVs in Varying Operational Conditions.

Andreas B Martinsen¹, Anastasios M Lekkas^1,2, Sébastien Gros¹, Jon Arne Glomsrud³, Tom Arne Pedersen³.

Abstract

We present a reinforcement learning-based (RL) control scheme for trajectory tracking of fully-actuated surface vessels. The proposed method learns online both a model-based feedforward controller, as well an optimizing feedback policy in order to follow a desired trajectory under the influence of environmental forces. The method's efficiency is evaluated via simulations and sea trials, with the unmanned surface vehicle (USV) ReVolt performing three different tracking tasks: The four corner DP test, straight-path tracking and curved-path tracking. The results demonstrate the method's ability to accomplish the control objectives and a good agreement between the performance achieved in the Revolt Digital Twin and the sea trials. Finally, we include an section with considerations about assurance for RL-based methods and where our approach stands in terms of the main challenges.

Entities: Chemical Disease Gene Species

Keywords: approximate dynamic programming (ADP); autonomous ships; dynamic positioning (DP); model-based adaptive control; optimal control; reinforcement learning; system identification; trajectory tracking

Year: 2020 PMID： 33501200 PMCID： PMC7806118 DOI： 10.3389/frobt.2020.00032

Source DB: PubMed Journal: Front Robot AI ISSN： 2296-9144

2 in total

1. Reinforcement learning in continuous time and space.

Authors: K Doya
Journal: Neural Comput Date: 2000-01 Impact factor: 2.026

2. Mastering the game of Go with deep neural networks and tree search.

Authors: David Silver; Aja Huang; Chris J Maddison; Arthur Guez; Laurent Sifre; George van den Driessche; Julian Schrittwieser; Ioannis Antonoglou; Veda Panneershelvam; Marc Lanctot; Sander Dieleman; Dominik Grewe; John Nham; Nal Kalchbrenner; Ilya Sutskever; Timothy Lillicrap; Madeleine Leach; Koray Kavukcuoglu; Thore Graepel; Demis Hassabis
Journal: Nature Date: 2016-01-28 Impact factor: 49.962

2 in total

1 in total

1. Estimating spatio-temporal fields through reinforcement learning.

Authors: Paulo Padrao; Jose Fuentes; Leonardo Bobadilla; Ryan N Smith
Journal: Front Robot AI Date: 2022-09-05

1 in total