Maximum Entropy Inverse Reinforcement Learning-Based Trajectory Planning for Autonomous Driving

Peng Zhang,Sihong Xie,Xing Lv,Zhihui Zhong,Qing Li
DOI: https://doi.org/10.1117/12.3026671
2024-01-01
Abstract:With the rapid advancement of autonomous driving technology, effective trajectory planning has become crucial for ensuring road safety and driving efficiency. Traditional trajectory planning methods often rely on preset rules and models, making them ill-suited for the complex and dynamic traffic environment. To address this, a Maximum Entropy Inverse Reinforcement Learning (MaxEnt IRL) based trajectory planning method is proposed in this paper, aiming at learning from expert driving behaviors to infer an efficient reward function, which in turn guides decision-making and path planning for autonomous vehicles. This study begins by analyzing expert driving data to extract key state and action features. Then, the MaxEnt IRL algorithm is applied to learn the reward function underlying these features, reflecting the decision-making logic of expert drivers. The learned reward function is subsequently used to guide the trajectory planning of the autonomous driving system, generating safe and efficient driving paths. A series of experiments conducted in a simulated environment demonstrate that the MaxEnt IRL-based method proposed in this paper exhibits higher adaptability and efficiency in handling complex traffic scenarios compared to traditional trajectory planning methods.
What problem does this paper attempt to address?