Literature DB >> 25935054

Off-Policy Actor-Critic Structure for Optimal Control of Unknown Systems With Disturbances.

Ruizhuo Song, Frank L Lewis, Qinglai Wei, Huaguang Zhang.   

Abstract

An optimal control method is developed for unknown continuous-time systems with unknown disturbances in this paper. The integral reinforcement learning (IRL) algorithm is presented to obtain the iterative control. Off-policy learning is used to allow the dynamics to be completely unknown. Neural networks are used to construct critic and action networks. It is shown that if there are unknown disturbances, off-policy IRL may not converge or may be biased. For reducing the influence of unknown disturbances, a disturbances compensation controller is added. It is proven that the weight errors are uniformly ultimately bounded based on Lyapunov techniques. Convergence of the Hamiltonian function is also proven. The simulation study demonstrates the effectiveness of the proposed optimal control method for unknown systems with disturbances.

Year:  2015        PMID: 25935054     DOI: 10.1109/TCYB.2015.2421338

Source DB:  PubMed          Journal:  IEEE Trans Cybern        ISSN: 2168-2267            Impact factor:   11.448


  1 in total

Review 1.  Reinforcement learning in surgery.

Authors:  Shounak Datta; Yanjun Li; Matthew M Ruppert; Yuanfang Ren; Benjamin Shickel; Tezcan Ozrazgat-Baslanti; Parisa Rashidi; Azra Bihorac
Journal:  Surgery       Date:  2021-01-09       Impact factor: 4.348

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.