Literature DB >> 29771662

Optimal and Autonomous Control Using Reinforcement Learning: A Survey.

Bahare Kiumarsi, Kyriakos G Vamvoudakis, Hamidreza Modares, Frank L Lewis.   

Abstract

This paper reviews the current state of the art on reinforcement learning (RL)-based feedback control solutions to optimal regulation and tracking of single and multiagent systems. Existing RL solutions to both optimal and control problems, as well as graphical games, will be reviewed. RL methods learn the solution to optimal control and game problems online and using measured data along the system trajectories. We discuss Q-learning and the integral RL algorithm as core algorithms for discrete-time (DT) and continuous-time (CT) systems, respectively. Moreover, we discuss a new direction of off-policy RL for both CT and DT systems. Finally, we review several applications.

Entities:  

Year:  2018        PMID: 29771662     DOI: 10.1109/TNNLS.2017.2773458

Source DB:  PubMed          Journal:  IEEE Trans Neural Netw Learn Syst        ISSN: 2162-237X            Impact factor:   10.451


  6 in total

1.  Counter a Drone in a Complex Neighborhood Area by Deep Reinforcement Learning.

Authors:  Ender Çetin; Cristina Barrado; Enric Pastor
Journal:  Sensors (Basel)       Date:  2020-04-18       Impact factor: 3.576

2.  Reinforcement Emotion-Cognition System: A Teaching Words Task.

Authors:  Minjia Li; Lun Xie; Anqi Zhang; Fuji Ren
Journal:  Comput Intell Neurosci       Date:  2019-05-02

3.  Cognitive Factories: Modeling Situated Entropy in Physical Work Carried Out by Humans and Robots.

Authors:  Stephen Fox; Adrian Kotelba; Ilkka Niskanen
Journal:  Entropy (Basel)       Date:  2018-09-01       Impact factor: 2.524

Review 4.  Reinforcement Learning for Clinical Decision Support in Critical Care: Comprehensive Review.

Authors:  Siqi Liu; Kay Choong See; Kee Yuan Ngiam; Leo Anthony Celi; Xingzhi Sun; Mengling Feng
Journal:  J Med Internet Res       Date:  2020-07-20       Impact factor: 5.428

5.  A Reinforcement Learning Control in Hot Stamping for Cycle Time Optimization.

Authors:  Nuria Nievas; Adela Pagès-Bernaus; Francesc Bonada; Lluís Echeverria; Albert Abio; Danillo Lange; Jaume Pujante
Journal:  Materials (Basel)       Date:  2022-07-11       Impact factor: 3.748

6.  Adaptive Sliding Mode Disturbance Observer and Deep Reinforcement Learning Based Motion Control for Micropositioners.

Authors:  Shiyun Liang; Ruidong Xi; Xiao Xiao; Zhixin Yang
Journal:  Micromachines (Basel)       Date:  2022-03-17       Impact factor: 2.891

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.