Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 FMRQ-A Multiagent Reinforcement Learning Algorithm for Fully Cooperative Tasks.

Literature DB >> 27101627

FMRQ-A Multiagent Reinforcement Learning Algorithm for Fully Cooperative Tasks.

Zhen Zhang, Dongbin Zhao, Junwei Gao, Dongqing Wang, Yujie Dai.

Abstract

In this paper, we propose a multiagent reinforcement learning algorithm dealing with fully cooperative tasks. The algorithm is called frequency of the maximum reward Q-learning (FMRQ). FMRQ aims to achieve one of the optimal Nash equilibria so as to optimize the performance index in multiagent systems. The frequency of obtaining the highest global immediate reward instead of immediate reward is used as the reinforcement signal. With FMRQ each agent does not need the observation of the other agents' actions and only shares its state and reward at each step. We validate FMRQ through case studies of repeated games: four cases of two-player two-action and one case of three-player two-action. It is demonstrated that FMRQ can converge to one of the optimal Nash equilibria in these cases. Moreover, comparison experiments on tasks with multiple states and finite steps are conducted. One is box-pushing and the other one is distributed sensor network problem. Experimental results show that the proposed algorithm outperforms others with higher performance.

Year: 2016 PMID： 27101627 DOI： 10.1109/TCYB.2016.2544866

Source DB: PubMed Journal: IEEE Trans Cybern ISSN： 2168-2267 Impact factor: 11.448

Keyword Cloud
Cited

2 in total

1. Multi-AGV path planning with double-path constraints by using an improved genetic algorithm.

Authors: Zengliang Han; Dongqing Wang; Feng Liu; Zhiyong Zhao
Journal: PLoS One Date: 2017-07-26 Impact factor: 3.240

2. Model Learning and Knowledge Sharing for Cooperative Multiagent Systems in Stochastic Environment.

Authors: Wei-Cheng Jiang; Vignesh Narayanan; Jr-Shin Li
Journal: IEEE Trans Cybern Date: 2021-12-22 Impact factor: 11.448

2 in total