Modular Hierarchical Reinforcement Learning for Multi-Destination Navigation in Hybrid Crowds

Wen Ou,Biao Luo,Bingchuan Wang,Yuqian Zhao
DOI: https://doi.org/10.1016/j.neunet.2023.12.032
IF: 7.8
2024-01-01
Neural Networks
Abstract:Real-world robot applications usually require navigating agents to face multiple destinations. Besides, the real-world crowded environments usually contain dynamic and static crowds that implicitly interact with each other during navigation. To address this challenging task, a novel modular hierarchical reinforcement learning (MHRL) method is developed in this paper. MHRL is composed of three modules, i.e., destination evaluation, policy switch, and motion network, which are designed exactly according to the three phases of solving the original navigation problem. First, the destination evaluation module rates all destinations and selects the one with the lowest cost. Subsequently, the policy switch module decides which motion network to be used according to the selected destination and the obstacle state. Finally, the selected motion network outputs the robot action. Owing to the complementary strengths of a variety of motion networks and the cooperation of modules in each layer, MHRL is able to deal with hybrid crowds effectively. Extensive simulation experiments demonstrate that MHRL achieves better performance than state-of-the-art methods.
What problem does this paper attempt to address?