Deep Reinforcement Learning for Robot Collision Avoidance With Self-State-Attention and Sensor Fusion

Yiheng Han,Irvin Haozhe Zhan,Wang Zhao,Jia Pan,Ziyang Zhang,Yaoyuan Wang,Yong-Jin Liu
DOI: https://doi.org/10.1109/lra.2022.3178791
IF: 5.2
2022-07-01
IEEE Robotics and Automation Letters
Abstract:3D LiDAR sensors can provide 3D point clouds of the environment, and are widely used in automobile navigation; while 2D LiDAR sensors can only provide point cloud in a 2D sweeping plane, and then are only used for navigating robots of small height, e.g., floor mopping robots. In this letter, we propose a simple yet effective deep reinforcement learning (DRL) method with our self-state-attention unit and give a solution that can use low-cost devices (i.e., a 2D LiDAR sensor and a monocular camera) to navigate a tall mobile robot of one meter height. The overrall pipeline is that we (1) infer the dense depth information of RGB images with the aid of the 2D LiDAR sensor data (i.e., point clouds in a plane with fixed height), (2) further filter the dense depth map into a 2D minimal depth data and fuse with 2D LiDAR data, and (3) make use of DRL module with our self-state-attention unit to a partially observable sequential decision making problem that can deal with partially accurate data. We present a novel DRL training scheme for robot navigation, proposing a concise and effective self-state-attention unit and proving that applying this unit can replace multi-stage training, achieve better results and generalization capability. Experiments on both simulated data and a real robot show that our method can perform efficient collision avoidance only using low-cost 2D LiDAR sensor and monocular camera.
robotics
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how to use low - cost 2D LiDAR sensors and monocular cameras to achieve high - efficiency obstacle avoidance in robot navigation. Specifically, the paper proposes a method based on deep reinforcement learning (DRL), combined with self - state - attention unit and sensor fusion technology, to improve the obstacle - avoidance ability of robots. This method is especially suitable for mobile robots with a relatively high height, such as the multi - functional robot shown in Figure 1. ### Main Problems 1. **Application of Low - Cost Sensors**: Existing obstacle - avoidance methods usually rely on expensive 3D LiDAR sensors, while this paper aims to use low - cost 2D LiDAR sensors and monocular cameras to achieve efficient obstacle avoidance. 2. **Decision - Making in Partially Observable Environments**: In actual environments, robots can often only obtain partially accurate data. How to make effective obstacle - avoidance decisions in such cases is a challenge. 3. **Reward Weight Adjustment**: In scenarios with different levels of complexity, it is very difficult to adjust the reward weights, which affects the generalization ability of the model. ### Solutions 1. **Depth Estimation and Sensor Fusion**: - Use 2D LiDAR sensor data to assist in the dense depth estimation of RGB images. - Compress the estimated dense depth map into 2D minimum - depth data and fuse it with 2D LiDAR data to generate enhanced depth - perception information. 2. **Self - State - Attention Unit**: - Introduce a self - state - attention unit to dynamically adjust the importance of input data according to the type, target position, and speed of the robot. - In this way, complex multi - stage training can be avoided, and the generalization ability and obstacle - avoidance effect of the model can be improved. 3. **Deep Reinforcement Learning Framework**: - Design a simple DRL framework based on the partially observable Markov decision process (POMDP). - Use the Proximal Policy Optimization (PPO) algorithm for training and optimize the loss function. ### Experimental Verification 1. **Accuracy of Depth Estimation**: - Conduct experiments on the NYU - Depth - v2 dataset to verify the accuracy of the proposed depth - estimation method. - The results show that using 2D LiDAR sensor data significantly improves the accuracy of depth estimation and is superior to existing methods. 2. **Performance of the DRL Framework**: - Evaluate the performance of the DRL framework in multiple test scenarios, including static scenarios, dynamic scenarios, and randomly complex obstacle scenarios. - The experimental results show that the proposed method performs excellently in terms of success rate and average number of time steps and has good generalization ability. ### Conclusion The paper proposes a low - cost obstacle - avoidance method based on deep reinforcement learning and self - state - attention unit, which successfully solves the problems of high - cost - sensor dependence and obstacle - avoidance in partially observable environments. The experimental results verify the effectiveness and generalization ability of this method.