A Hysteretic Q-learning Coordination Framework for Emerging Mobility Systems in Smart Cities

Behdad Chalaki,Andreas A. Malikopoulos
DOI: https://doi.org/10.23919/ECC54610.2021.9655172
2020-11-06
Abstract:Connected and automated vehicles (CAVs) can alleviate traffic congestion, air pollution, and improve safety. In this paper, we provide a decentralized coordination framework for CAVs at a signal-free intersection to minimize travel time and improve fuel efficiency. We employ a simple yet powerful reinforcement learning approach, an off-policy temporal difference learning called Q-learning, enhanced with a coordination mechanism to address this problem. Then, we integrate a first-in-first-out queuing policy to improve the performance of our system. We demonstrate the efficacy of our proposed approach through simulation and comparison with the classical optimal control method based on Pontryagin's minimum principle.
Optimization and Control,Machine Learning,Multiagent Systems,Systems and Control
What problem does this paper attempt to address?