Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Heuristically-accelerated multiagent reinforcement learning.

Literature DB >> 23757547

Heuristically-accelerated multiagent reinforcement learning.

Reinaldo A C Bianchi, Murilo F Martins, Carlos H C Ribeiro, Anna Helena Reali Costa.

Abstract

This paper presents a novel class of algorithms, called Heuristically-Accelerated Multiagent Reinforcement Learning (HAMRL), which allows the use of heuristics to speed up well-known multiagent reinforcement learning (RL) algorithms such as the Minimax-Q. Such HAMRL algorithms are characterized by a heuristic function, which suggests the selection of particular actions over others. This function represents an initial action selection policy, which can be handcrafted, extracted from previous experience in distinct domains, or learnt from observation. To validate the proposal, a thorough theoretical analysis proving the convergence of four algorithms from the HAMRL class (HAMMQ, HAMQ(λ), HAMQS, and HAMS) is presented. In addition, a comprehensive systematical evaluation was conducted in two distinct adversarial domains. The results show that even the most straightforward heuristics can produce virtually optimal action selection policies in much fewer episodes, significantly improving the performance of the HAMRL over vanilla RL algorithms.

Mesh：

Year: 2014 PMID： 23757547 DOI： 10.1109/TCYB.2013.2253094

Source DB: PubMed Journal: IEEE Trans Cybern ISSN： 2168-2267 Impact factor: 11.448

Keyword Cloud
Cited

1 in total

1. Route searching based on neural networks and heuristic reinforcement learning.

Authors: Fengyun Zhang; Shukai Duan; Lidan Wang
Journal: Cogn Neurodyn Date: 2017-02-09 Impact factor: 5.082

1 in total