Literature DB >> 24807455

Policy iteration adaptive dynamic programming algorithm for discrete-time nonlinear systems.

Derong Liu, Qinglai Wei.   

Abstract

This paper is concerned with a new discrete-time policy iteration adaptive dynamic programming (ADP) method for solving the infinite horizon optimal control problem of nonlinear systems. The idea is to use an iterative ADP technique to obtain the iterative control law, which optimizes the iterative performance index function. The main contribution of this paper is to analyze the convergence and stability properties of policy iteration method for discrete-time nonlinear systems for the first time. It shows that the iterative performance index function is nonincreasingly convergent to the optimal solution of the Hamilton-Jacobi-Bellman equation. It is also proven that any of the iterative control laws can stabilize the nonlinear systems. Neural networks are used to approximate the performance index function and compute the optimal control law, respectively, for facilitating the implementation of the iterative ADP algorithm, where the convergence of the weight matrices is analyzed. Finally, the numerical results and analysis are presented to illustrate the performance of the developed method.

Entities:  

Year:  2014        PMID: 24807455     DOI: 10.1109/TNNLS.2013.2281663

Source DB:  PubMed          Journal:  IEEE Trans Neural Netw Learn Syst        ISSN: 2162-237X            Impact factor:   10.451


  2 in total

1.  Data-Driven Optimal Assistance Control of a Lower Limb Exoskeleton for Hemiplegic Patients.

Authors:  Zhinan Peng; Rui Luo; Rui Huang; Tengbo Yu; Jiangping Hu; Kecheng Shi; Hong Cheng
Journal:  Front Neurorobot       Date:  2020-07-03       Impact factor: 2.650

2.  Safe Model-Based Reinforcement Learning for Systems With Parametric Uncertainties.

Authors:  S M Nahid Mahmud; Scott A Nivison; Zachary I Bell; Rushikesh Kamalapurkar
Journal:  Front Robot AI       Date:  2021-12-16
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.