TY - GEN
T1 - PairUpLight
T2 - 45th IEEE International Conference on Distributed Computing Systems, ICDCS 2025
AU - Du, Wenlu
AU - Li, Jing
AU - Wang, Guiling Grace
N1 - Publisher Copyright:
© 2025 IEEE.
PY - 2025
Y1 - 2025
N2 - The management of heavy traffic demands has been significantly improved by employing synchronized traffic signal control at multiple intersections. Multi-agent Reinforcement Learning (MARL) techniques have been widely utilized to achieve this coordination. However, these approaches predominantly depend on manually crafted features from adjacent intersections, which impedes their generalization to new scenarios. Furthermore, while displaying high accuracy for specific traffic flow patterns, these methods often lack the necessary robustness for other patterns. In this study, our objective is to develop an effective signal timing plan by directly learning the minimal required communication between intersections from traffic data. We introduce a novel, comprehensive approach that combines multi-agent reinforcement learning with a learned communication mechanism. Our model incorporates a coordinated actor network and a centralized critic network to address the challenges of non-stationarity. We conducted extensive experiments comparing our model with other commonly used non-RL and benchmark MARL techniques. The evaluation results show that our proposed model, which relies only on local sensory input and a single message from neighboring intersections, excels in managing various traffic flow patterns. Furthermore, our model outperforms competing approaches in terms of robustness, resilience, and overall performance.
AB - The management of heavy traffic demands has been significantly improved by employing synchronized traffic signal control at multiple intersections. Multi-agent Reinforcement Learning (MARL) techniques have been widely utilized to achieve this coordination. However, these approaches predominantly depend on manually crafted features from adjacent intersections, which impedes their generalization to new scenarios. Furthermore, while displaying high accuracy for specific traffic flow patterns, these methods often lack the necessary robustness for other patterns. In this study, our objective is to develop an effective signal timing plan by directly learning the minimal required communication between intersections from traffic data. We introduce a novel, comprehensive approach that combines multi-agent reinforcement learning with a learned communication mechanism. Our model incorporates a coordinated actor network and a centralized critic network to address the challenges of non-stationarity. We conducted extensive experiments comparing our model with other commonly used non-RL and benchmark MARL techniques. The evaluation results show that our proposed model, which relies only on local sensory input and a single message from neighboring intersections, excels in managing various traffic flow patterns. Furthermore, our model outperforms competing approaches in terms of robustness, resilience, and overall performance.
KW - Multi-agent Systems
KW - Reinforcement Learning
KW - Traffic Signal Control
UR - https://www.scopus.com/pages/publications/105019758517
UR - https://www.scopus.com/pages/publications/105019758517#tab=citedBy
U2 - 10.1109/ICDCS63083.2025.00073
DO - 10.1109/ICDCS63083.2025.00073
M3 - Conference contribution
AN - SCOPUS:105019758517
T3 - Proceedings - International Conference on Distributed Computing Systems
SP - 692
EP - 702
BT - Proceedings - 2025 IEEE 45th International Conference on Distributed Computing Systems, ICDCS 2025
PB - Institute of Electrical and Electronics Engineers Inc.
Y2 - 20 July 2025 through 23 July 2025
ER -