Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Analysing factorizations of action-value networks for cooperative multi-agent reinforcement learning.

Literature DB >> 34720685

Analysing factorizations of action-value networks for cooperative multi-agent reinforcement learning.

Jacopo Castellini¹, Frans A Oliehoek², Rahul Savani¹, Shimon Whiteson³.

Abstract

Recent years have seen the application of deep reinforcement learning techniques to cooperative multi-agent systems, with great empirical success. However, given the lack of theoretical insight, it remains unclear what the employed neural networks are learning, or how we should enhance their learning power to address the problems on which they fail. In this work, we empirically investigate the learning power of various network architectures on a series of one-shot games. Despite their simplicity, these games capture many of the crucial problems that arise in the multi-agent setting, such as an exponential number of joint actions or the lack of an explicit coordination mechanism. Our results extend those in Castellini et al. (Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems, AAMAS'19.International Foundation for Autonomous Agents and Multiagent Systems, pp 1862-1864, 2019) and quantify how well various approaches can represent the requisite value functions, and help us identify the reasons that can impede good performance, like sparsity of the values or too tight coordination requirements.

Entities: Chemical

Keywords: Action-value representation; Decision-making; Multi-agent systems; Neural networks; One-shot games

Year: 2021 PMID： 34720685 PMCID： PMC8550438 DOI： 10.1007/s10458-021-09506-w

Source DB: PubMed Journal: Auton Agent Multi Agent Syst ISSN： 1387-2532 Impact factor: 1.431

Keyword Cloud
References

4 in total

1. Long short-term memory.

Authors: S Hochreiter; J Schmidhuber
Journal: Neural Comput Date: 1997-11-15 Impact factor: 2.026

2. Human-level control through deep reinforcement learning.

Authors: Volodymyr Mnih; Koray Kavukcuoglu; David Silver; Andrei A Rusu; Joel Veness; Marc G Bellemare; Alex Graves; Martin Riedmiller; Andreas K Fidjeland; Georg Ostrovski; Stig Petersen; Charles Beattie; Amir Sadik; Ioannis Antonoglou; Helen King; Dharshan Kumaran; Daan Wierstra; Shane Legg; Demis Hassabis
Journal: Nature Date: 2015-02-26 Impact factor: 49.962

3. A multi-agent framework for packet routing in wireless sensor networks.

Authors: Dayong Ye; Minjie Zhang; Yun Yang
Journal: Sensors (Basel) Date: 2015-04-28 Impact factor: 3.576

4. Multiagent cooperation and competition with deep reinforcement learning.

Authors: Ardi Tampuu; Tambet Matiisen; Dorian Kodelja; Ilya Kuzovkin; Kristjan Korjus; Juhan Aru; Jaan Aru; Raul Vicente
Journal: PLoS One Date: 2017-04-05 Impact factor: 3.240

4 in total