Literature DB >> 30946688

Cooperative Deep Reinforcement Learning for Large-Scale Traffic Grid Signal Control.

Tian Tan, Feng Bao, Yue Deng, Alex Jin, Qionghai Dai, Jie Wang.   

Abstract

Exploiting reinforcement learning (RL) for traffic congestion reduction is a frontier topic in intelligent transportation research. The difficulty in this problem stems from the inability of the RL agent simultaneously monitoring multiple signal lights when taking into account complicated traffic dynamics in different regions of a traffic system. Such challenge is even more outstanding when forming control decisions on a large-scale traffic grid, where the RL action space grows exponentially with the number of intersections within the traffic grid. In this paper, we tackle such a problem by proposing a cooperative deep reinforcement learning (Coder) framework. The intuition behind Coder is to decompose the original difficult RL task as a number of subproblems with relatively easy RL goals. Accordingly, we implement Coder with multiple regional agents and a centralized global agent. Each regional agent learns its own RL policy and value functions over a small region with limited actions. Then, the centralized global agent hierarchically aggregates RL achievements from different regional agents and forms the final Q -function over the entire large-scale traffic grid. The experimental investigations demonstrate that the proposed Coder could reduce on average 30% congestions in terms of the number of waiting vehicles during high density traffic flows in simulations.

Entities:  

Year:  2019        PMID: 30946688     DOI: 10.1109/TCYB.2019.2904742

Source DB:  PubMed          Journal:  IEEE Trans Cybern        ISSN: 2168-2267            Impact factor:   11.448


  2 in total

1.  Explaining deep reinforcement learning decisions in complex multiagent settings: towards enabling automation in air traffic flow management.

Authors:  Theocharis Kravaris; Konstantinos Lentzos; Georgios Santipantakis; George A Vouros; Gennady Andrienko; Natalia Andrienko; Ian Crook; Jose Manuel Cordero Garcia; Enrique Iglesias Martinez
Journal:  Appl Intell (Dordr)       Date:  2022-06-06       Impact factor: 5.019

2.  Intelligent L2-L∞ Consensus of Multiagent Systems under Switching Topologies via Fuzzy Deep Q Learning.

Authors:  Haoyu Cheng; Linpeng Xu; Ruijia Song; Yue Zhu; Yangwang Fang
Journal:  Comput Intell Neurosci       Date:  2022-02-16
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.