Interaction Prediction and Monte-Carlo Tree Search for Robot Manipulation in Clutter

Baichuan Huang,Shuai D. Han,Abdeslam Boularias,Jingjin Yu
2021-01-01
Abstract:We considered two related robot manipulation problems: clutter removal and object retrieval in clutter, while emphasizing action efficiency and exploring synergy between pushing and grasping. For clutter removal, we propose a Deep Interaction Prediction Network (DIPN) for predicting interactions between robot gripper and objects. Together with a grasp network for predicting grasping success, DIPN generates intelligent pushes for solving clutter removal problem efficiently. For the second, more complex object retrieval problem, we combine DIPN and Monte-Carlo Tree Search to generate smart multi-step push predictions that yields the best target object pose for grasping. We call this fusion of methods the Visual Foresight Tree (VFT). Experiments in simulation and using real robot show that our methods significantly outperform the previous state-of-the-art, and produce human-like performance. More detail can be found at https://arxiv.org/pdf/2011.04692.pdf and https://arxiv.org/pdf/2105.02857.pdf
What problem does this paper attempt to address?