Literature DB >> 18255702

Adaptive critic designs.

D V Prokhorov1, D C Wunsch.   

Abstract

We discuss a variety of adaptive critic designs (ACDs) for neurocontrol. These are suitable for learning in noisy, nonlinear, and nonstationary environments. They have common roots as generalizations of dynamic programming for neural reinforcement learning approaches. Our discussion of these origins leads to an explanation of three design families: heuristic dynamic programming, dual heuristic programming, and globalized dual heuristic programming (GDHP). The main emphasis is on DHP and GDHP as advanced ACDs. We suggest two new modifications of the original GDHP design that are currently the only working implementations of GDHP. They promise to be useful for many engineering applications in the areas of optimization and optimal control. Based on one of these modifications, we present a unified approach to all ACDs. This leads to a generalized training procedure for ACDs.

Entities:  

Year:  1997        PMID: 18255702     DOI: 10.1109/72.623201

Source DB:  PubMed          Journal:  IEEE Trans Neural Netw        ISSN: 1045-9227


  2 in total

Review 1.  Future of seizure prediction and intervention: closing the loop.

Authors:  Vivek Nagaraj; Steven T Lee; Esther Krook-Magnuson; Ivan Soltesz; Pascal Benquet; Pedro P Irazoqui; Theoden I Netoff
Journal:  J Clin Neurophysiol       Date:  2015-06       Impact factor: 2.177

2.  Artificial Development by Reinforcement Learning Can Benefit From Multiple Motivations.

Authors:  Günther Palm; Friedhelm Schwenker
Journal:  Front Robot AI       Date:  2019-02-14
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.