Literature DB >> 31944970

Model Learning and Knowledge Sharing for Cooperative Multiagent Systems in Stochastic Environment.

Wei-Cheng Jiang, Vignesh Narayanan, Jr-Shin Li.   

Abstract

An imposing task for a reinforcement learning agent in an uncertain environment is to expeditiously learn a policy or a sequence of actions, with which it can achieve the desired goal. In this article, we present an incremental model learning scheme to reconstruct the model of a stochastic environment. In the proposed learning scheme, we introduce a clustering algorithm to assimilate the model information and estimate the probability for each state transition. In addition, utilizing the reconstructed model, we present an experience replay strategy to create virtual interactive experiences by incorporating a balance between exploration and exploitation, which greatly accelerates learning and enables planning. Furthermore, we extend the proposed learning scheme for a multiagent framework to decrease the effort required for exploration and to reduce the learning time in a large environment. In this multiagent framework, we introduce a knowledge-sharing algorithm to share the reconstructed model information among the different agents, as needed, and develop a computationally efficient knowledge fusing mechanism to fuse the knowledge acquired using the agents' own experience with the knowledge received from its teammates. Finally, the simulation results with comparative analysis are provided to demonstrate the efficacy of the proposed methods in the complex learning tasks.

Entities:  

Year:  2021        PMID: 31944970      PMCID: PMC7338261          DOI: 10.1109/TCYB.2019.2958912

Source DB:  PubMed          Journal:  IEEE Trans Cybern        ISSN: 2168-2267            Impact factor:   11.448


  10 in total

1.  A study on expertise of agents and its effects on cooperative Q-learning.

Authors:  Babak Nadjar Araabi; Sahar Mastoureshgh; Majid Nili Ahmadabadi
Journal:  IEEE Trans Syst Man Cybern B Cybern       Date:  2007-04

2.  Expertness based cooperative Q-learning.

Authors:  M N Ahmadabadi; M Asadpour
Journal:  IEEE Trans Syst Man Cybern B Cybern       Date:  2002

3.  Parallel Online Temporal Difference Learning for Motor Control.

Authors:  Wouter Caarls; Erik Schuitema
Journal:  IEEE Trans Neural Netw Learn Syst       Date:  2015-06-23       Impact factor: 10.451

4.  Model learning and knowledge sharing for a multiagent system with Dyna-Q learning.

Authors:  Kao-Shing Hwang; Wei-Cheng Jiang; Yu-Jen Chen
Journal:  IEEE Trans Cybern       Date:  2014-08-05       Impact factor: 11.448

5.  A clustering-based graph Laplacian framework for value function approximation in reinforcement learning.

Authors:  Xin Xu; Zhenhua Huang; Daniel Graves; Witold Pedrycz
Journal:  IEEE Trans Cybern       Date:  2014-04-25       Impact factor: 11.448

6.  Discrete-Time Deterministic $Q$ -Learning: A Novel Convergence Analysis.

Authors:  Qinglai Wei; Frank L Lewis; Qiuye Sun; Pengfei Yan; Ruizhuo Song
Journal:  IEEE Trans Cybern       Date:  2016-04-11       Impact factor: 11.448

7.  FMRQ-A Multiagent Reinforcement Learning Algorithm for Fully Cooperative Tasks.

Authors:  Zhen Zhang; Dongbin Zhao; Junwei Gao; Dongqing Wang; Yujie Dai
Journal:  IEEE Trans Cybern       Date:  2016-04-14       Impact factor: 11.448

8.  Extending the Peak Bandwidth of Parameters for Softmax Selection in Reinforcement Learning.

Authors:  Kazunori Iwata
Journal:  IEEE Trans Neural Netw Learn Syst       Date:  2016-05-11       Impact factor: 10.451

9.  Model-Based Reinforcement Learning for Infinite-Horizon Approximate Optimal Tracking.

Authors:  Rushikesh Kamalapurkar; Lindsey Andrews; Patrick Walters; Warren E Dixon
Journal:  IEEE Trans Neural Netw Learn Syst       Date:  2016-02-03       Impact factor: 10.451

10.  MOO-MDP: An Object-Oriented Representation for Cooperative Multiagent Reinforcement Learning.

Authors:  Felipe Leno Da Silva; Ruben Glatt; Anna Helena Reali Costa
Journal:  IEEE Trans Cybern       Date:  2017-12-28       Impact factor: 11.448

  10 in total
  1 in total

1.  Knowledge Reuse of Multi-Agent Reinforcement Learning in Cooperative Tasks.

Authors:  Daming Shi; Junbo Tong; Yi Liu; Wenhui Fan
Journal:  Entropy (Basel)       Date:  2022-03-28       Impact factor: 2.524

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.