Mastering Robot Control Through Point-based Reinforcement Learning with Pre-training.
Yihong Chen,Cong Wang,Tianpei Yang,Meng Wang,Yingfeng Chen,Jifei Zhou,Chaoyi Zhao,Xinfeng Zhang,Zeng Zhao,Changjie Fan,Zhipeng Hu,Rong Xiong,Long Zeng
DOI: https://doi.org/10.5555/3635637.3663106
2024-01-01
Abstract:Visual-based Reinforcement Learning (RL) has gained prominence in robotics decision-making due to its significant potential. However, the prevalent utilization of images in visual-based RL lacks explicit descriptions of object structures and spatial configurations in scenes, thereby limiting the overall efficiency and robustness of RL in robot control. Additionally, training an RL policy solely using visual observations from scratch is typically sample-inefficient, rendering it impractical for real-world application. To address these challenges, this paper proposes a novel method, called Pre-training on Point-based RL (P2RL), which takes the point cloud representations of scenes as states and preserves the intricate spatial details between objects. To further enhance efficiency, we leverage the pre-training method to bolster the perception ability of the network. Key factors in the pre-training process are systematically examined to optimize downstream RL training. Experimental results demonstrate the superior robustness and efficiency of P2RL compared to the state-of-the-art image-based RL method, especially in evaluations involving untrained scenes.