Feudal Multi-Agent Deep Reinforcement Learning for Traffic Signal Control

,

Reinforcement learning (RL) is a promising technique for optimizing traffic signal controllers that dynamically respond to realtime traffic conditions. Recent efforts that applied Multi-Agent RL (MARL) to this problem have shown remarkable improvement over centralized RL, with the scalability to solve large problems by distributing the global control to local RL agents. Unfortunately, it is also easy to get stuck in local optima because each agent only has partial observability of the environment with limited communication. To tackle this, we borrow ideas from feudal RL and propose a novel MARL approach combining with the feudal hierarchy. Specifically, we split the traffic network into several regions, where each region is controlled by a manager agent and the agents who control the traffic signals are its workers. In our method, managers coordinate their high-level behaviors and set goals for their workers in the region, while each lower-level worker controls traffic signals to fulfill the managerial goals. By doing so, we are able to coordinate globally while retain scalability. We empirically evaluate our method both in a synthetic traffic grid and real-world traffic network using the SUMO simulator. Our experimental results show that our approach outperforms the state-of-the-art in almost all evaluation metrics commonly used for traffic signal control.

» Read on
Jinming Ma, Feng Wu. Feudal Multi-Agent Deep Reinforcement Learning for Traffic Signal Control. In Proceedings of the 19th International Conference on Autonomous Agents and Multiagent Systems (AAMAS), pages 816-824, Auckland, New Zealand, May 2020.
Save as file
@inproceedings{MWaamas20,
 address = {Auckland, New Zealand},
 author = {Jinming Ma and Feng Wu},
 booktitle = {Proceedings of the 19th International Conference on Autonomous Agents and Multiagent Systems (AAMAS)},
 month = {May},
 pages = {816-824},
 title = {Feudal Multi-Agent Deep Reinforcement Learning for Traffic Signal Control},
 year = {2020}
}