Literature DB >> 24802018

A clustering-based graph Laplacian framework for value function approximation in reinforcement learning.

Xin Xu, Zhenhua Huang, Daniel Graves, Witold Pedrycz.   

Abstract

In order to deal with the sequential decision problems with large or continuous state spaces, feature representation and function approximation have been a major research topic in reinforcement learning (RL). In this paper, a clustering-based graph Laplacian framework is presented for feature representation and value function approximation (VFA) in RL. By making use of clustering-based techniques, that is, K-means clustering or fuzzy C-means clustering, a graph Laplacian is constructed by subsampling in Markov decision processes (MDPs) with continuous state spaces. The basis functions for VFA can be automatically generated from spectral analysis of the graph Laplacian. The clustering-based graph Laplacian is integrated with a class of approximation policy iteration algorithms called representation policy iteration (RPI) for RL in MDPs with continuous state spaces. Simulation and experimental results show that, compared with previous RPI methods, the proposed approach needs fewer sample points to compute an efficient set of basis functions and the learning control performance can be improved for a variety of parameter settings.

Mesh:

Year:  2014        PMID: 24802018     DOI: 10.1109/TCYB.2014.2311578

Source DB:  PubMed          Journal:  IEEE Trans Cybern        ISSN: 2168-2267            Impact factor:   11.448


  3 in total

1.  Cluster Prototypes and Fuzzy Memberships Jointly Leveraged Cross-Domain Maximum Entropy Clustering.

Authors:  Pengjiang Qian; Yizhang Jiang; Zhaohong Deng; Lingzhi Hu; Shouwei Sun; Shitong Wang; Raymond F Muzic
Journal:  IEEE Trans Cybern       Date:  2016-01       Impact factor: 11.448

2.  Model Learning and Knowledge Sharing for Cooperative Multiagent Systems in Stochastic Environment.

Authors:  Wei-Cheng Jiang; Vignesh Narayanan; Jr-Shin Li
Journal:  IEEE Trans Cybern       Date:  2021-12-22       Impact factor: 11.448

3.  A biomarker basing on radiomics for the prediction of overall survival in non-small cell lung cancer patients.

Authors:  Bo He; Wei Zhao; Jiang-Yuan Pi; Dan Han; Yuan-Ming Jiang; Zhen-Guang Zhang; Wei Zhao
Journal:  Respir Res       Date:  2018-10-10
  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.