Learning Generalizable Robot Skills from Demonstrations in Cluttered Environments

Muhammad Asif Rana,Mustafa Mukadam,Seyed Reza Ahmadzadeh,Sonia Chernova,Byron Boots
DOI: https://doi.org/10.48550/arXiv.1808.00349
2018-08-04
Abstract:Learning from Demonstration (LfD) is a popular approach to endowing robots with skills without having to program them by hand. Typically, LfD relies on human demonstrations in clutter-free environments. This prevents the demonstrations from being affected by irrelevant objects, whose influence can obfuscate the true intention of the human or the constraints of the desired skill. However, it is unrealistic to assume that the robot's environment can always be restructured to remove clutter when capturing human demonstrations. To contend with this problem, we develop an importance weighted batch and incremental skill learning approach, building on a recent inference-based technique for skill representation and reproduction. Our approach reduces unwanted environmental influences on the learned skill, while still capturing the salient human behavior. We provide both batch and incremental versions of our approach and validate our algorithms on a 7-DOF JACO2 manipulator with reaching and placing skills.
Robotics
What problem does this paper attempt to address?
This paper attempts to solve the problem of learning robotic skills through demonstration in cluttered environments. Specifically, most existing Learning from Demonstration (LfD) methods assume that demonstrations can be carried out in clutter - free and minimally - constrained environments, which can avoid the influence of irrelevant objects on human demonstrations. These influences may mask the true intentions of humans or the constraints of the required skills. However, in practical applications, it is not realistic to assume that the robot's environment can be readjusted at any time to remove clutter, which limits the application scope of these methods. To solve this problem, the author has developed a batch and incremental skill - learning method based on importance weighting. This method is built on the recently proposed inference - based skill representation and reproduction techniques, aiming to reduce the influence of unnecessary factors in the environment on the learned skills while still being able to capture the key parts of human behavior. The method proposed in the paper includes two versions, batch learning and incremental learning, and has been verified on the 7 - DOF JACO2 robotic arm, testing its performance in reaching and placing tasks. In short, the main contribution of this paper lies in providing a method that can effectively learn and generalize robotic skills in cluttered environments, which is of great significance for improving the adaptability of robots in dynamic and unstructured environments.