An Indirect Reinforcement Learning Approach For Ramp Control Under Incident-Induced Congestion

Chao Lu,Haibo Chen,Susan Grant-Muller
DOI: https://doi.org/10.1109/ITSC.2013.6728359
2013-01-01
Abstract:Incident-induced congestion is one of the main causes for delays on motorways. Strategies for managing such congestion using traffic control technologies can be classified into model-based and model-free methods. Both methods possess their own merits but also have drawbacks. Dyna-Q architecture is a method that can combine model-free learning and model-based planning together to obtain the benefits from both sides. Based on the Dyna-Q architecture, an indirect reinforcement learning (IRL) approach is derived in this study. The new method is compared with two other methods, namely DRL and ALINEA. Simulation experiment results show that, with suitable weight values, IRL can achieve a superior performance in many scenarios. Moreover, compared with DRL, IRL has a much faster learning speed.
What problem does this paper attempt to address?