Literature DB >> 32203045

Deep Reinforcement Learning for Multiagent Systems: A Review of Challenges, Solutions, and Applications.

Thanh Thi Nguyen, Ngoc Duy Nguyen, Saeid Nahavandi.   

Abstract

Reinforcement learning (RL) algorithms have been around for decades and employed to solve various sequential decision-making problems. These algorithms, however, have faced great challenges when dealing with high-dimensional environments. The recent development of deep learning has enabled RL methods to drive optimal policies for sophisticated and capable agents, which can perform efficiently in these challenging environments. This article addresses an important aspect of deep RL related to situations that require multiple agents to communicate and cooperate to solve complex tasks. A survey of different approaches to problems related to multiagent deep RL (MADRL) is presented, including nonstationarity, partial observability, continuous state and action spaces, multiagent training schemes, and multiagent transfer learning. The merits and demerits of the reviewed methods will be analyzed and discussed with their corresponding applications explored. It is envisaged that this review provides insights about various MADRL methods and can lead to the future development of more robust and highly useful multiagent learning methods for solving real-world problems.

Year:  2020        PMID: 32203045     DOI: 10.1109/TCYB.2020.2977374

Source DB:  PubMed          Journal:  IEEE Trans Cybern        ISSN: 2168-2267            Impact factor:   11.448


  11 in total

1.  Force-guided autonomous robotic ultrasound scanning control method for soft uncertain environment.

Authors:  Guochen Ning; Jiaqi Chen; Xinran Zhang; Hongen Liao
Journal:  Int J Comput Assist Radiol Surg       Date:  2021-08-09       Impact factor: 2.924

2.  Explaining deep reinforcement learning decisions in complex multiagent settings: towards enabling automation in air traffic flow management.

Authors:  Theocharis Kravaris; Konstantinos Lentzos; Georgios Santipantakis; George A Vouros; Gennady Andrienko; Natalia Andrienko; Ian Crook; Jose Manuel Cordero Garcia; Enrique Iglesias Martinez
Journal:  Appl Intell (Dordr)       Date:  2022-06-06       Impact factor: 5.019

3.  Multi-Agent Dynamic Resource Allocation in 6G in-X Subnetworks with Limited Sensing Information.

Authors:  Ramoni Adeogun; Gilberto Berardinelli
Journal:  Sensors (Basel)       Date:  2022-07-05       Impact factor: 3.847

Review 4.  Deep Reinforcement Learning for Resource Management on Network Slicing: A Survey.

Authors:  Johanna Andrea Hurtado Sánchez; Katherine Casilimas; Oscar Mauricio Caicedo Rendon
Journal:  Sensors (Basel)       Date:  2022-04-15       Impact factor: 3.847

5.  Distributed Learning for Dynamic Channel Access in Underwater Sensor Networks.

Authors:  Huicheol Shin; Yongjae Kim; Seungjae Baek; Yujae Song
Journal:  Entropy (Basel)       Date:  2020-09-07       Impact factor: 2.524

6.  Searching and Tracking an Unknown Number of Targets: A Learning-Based Method Enhanced with Maps Merging.

Authors:  Peng Yan; Tao Jia; Chengchao Bai
Journal:  Sensors (Basel)       Date:  2021-02-04       Impact factor: 3.576

7.  A Novel Training and Collaboration Integrated Framework for Human-Agent Teleoperation.

Authors:  Zebin Huang; Ziwei Wang; Weibang Bai; Yanpei Huang; Lichao Sun; Bo Xiao; Eric M Yeatman
Journal:  Sensors (Basel)       Date:  2021-12-14       Impact factor: 3.576

8.  Overcoming Challenges of Applying Reinforcement Learning for Intelligent Vehicle Control.

Authors:  Rafael Pina; Haileleol Tibebu; Joosep Hook; Varuna De Silva; Ahmet Kondoz
Journal:  Sensors (Basel)       Date:  2021-11-25       Impact factor: 3.576

9.  Performance Improvement of Single-Frequency CW Laser Using a Temperature Controller Based on Machine Learning.

Authors:  Haoming Qiao; Weina Peng; Pixian Jin; Jing Su; Huadong Lu
Journal:  Micromachines (Basel)       Date:  2022-06-30       Impact factor: 3.523

10.  RIS-Assisted Multi-Antenna AmBC Signal Detection Using Deep Reinforcement Learning.

Authors:  Feng Jing; Hailin Zhang; Mei Gao; Bin Xue; Kunrui Cao
Journal:  Sensors (Basel)       Date:  2022-08-16       Impact factor: 3.847

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.