RL-DWA Omnidirectional Motion Planning for Person Following in Domestic Assistance and Monitoring

Andrea Eirale,Mauro Martini,Marcello Chiaberge
DOI: https://doi.org/10.48550/arXiv.2211.04993
2023-01-13
Abstract:Robot assistants are emerging as high-tech solutions to support people in everyday life. Following and assisting the user in the domestic environment requires flexible mobility to safely move in cluttered spaces. We introduce a new approach to person following for assistance and monitoring. Our methodology exploits an omnidirectional robotic platform to detach the computation of linear and angular velocities and navigate within the domestic environment without losing track of the assisted person. While linear velocities are managed by a conventional Dynamic Window Approach (DWA) local planner, we trained a Deep Reinforcement Learning (DRL) agent to predict optimized angular velocities commands and maintain the orientation of the robot towards the user. We evaluate our navigation system on a real omnidirectional platform in various indoor scenarios, demonstrating the competitive advantage of our solution compared to a standard differential steering following.
Robotics,Artificial Intelligence
What problem does this paper attempt to address?
This paper aims to solve the problem of how robots can flexibly and safely follow and assist users in a home environment. Specifically, the author proposes a new human - following method for an omnidirectional mobile platform to achieve: 1. **Flexible mobility**: Move safely in a cluttered home environment, avoid obstacles, and not lose track of the user at the same time. 2. **Continuous visual monitoring**: By optimizing the direction control of the robot, ensure that the robot always faces the user, thereby maintaining continuous visual monitoring of the user. To achieve these goals, the author adopts the following technical means: - **Dynamic Window Approach (DWA)**: Used to generate linear velocity commands \([v_x, v_y]\) to ensure that the robot can move along a safe path. - **Deep Reinforcement Learning (DRL)**: Train an agent to predict the optimized angular velocity command \(\omega\) to maintain the direction of the robot towards the user. - **Real - time perception pipeline**: A real - time perception system based on PoseNet for detecting and tracking the user's position. Through the combination of these technologies, the author hopes to achieve more efficient and safer human - following tasks in a complex home environment, especially in cases where obstacle avoidance and continuous user monitoring are required. Experimental results show that compared with traditional differential - drive navigation, this method shows significant advantages in maintaining user tracking.