How Can Everyday Users Efficiently Teach Robots by Demonstrations?

Maram Sakr,Zhikai Zhang,Benjamin Li,Haomiao Zhang,H.F. Machiel Van der Loos,Dana Kulic,Elizabeth Croft
2023-10-20
Abstract:Learning from Demonstration (LfD) is a framework that allows lay users to easily program robots. However, the efficiency of robot learning and the robot's ability to generalize to task variations hinges upon the quality and quantity of the provided demonstrations. Our objective is to guide human teachers to furnish more effective demonstrations, thus facilitating efficient robot learning. To achieve this, we propose to use a measure of uncertainty, namely task-related information entropy, as a criterion for suggesting informative demonstration examples to human teachers to improve their teaching skills. In a conducted experiment (N=24), an augmented reality (AR)-based guidance system was employed to train novice users to produce additional demonstrations from areas with the highest entropy within the workspace. These novice users were trained for a few trials to teach the robot a generalizable task using a limited number of demonstrations. Subsequently, the users' performance after training was assessed first on the same task (retention) and then on a novel task (transfer) without guidance. The results indicated a substantial improvement in robot learning efficiency from the teacher's demonstrations, with an improvement of up to 198% observed on the novel task. Furthermore, the proposed approach was compared to a state-of-the-art heuristic rule and found to improve robot learning efficiency by 210% compared to the heuristic rule.
Robotics,Machine Learning
What problem does this paper attempt to address?
### Problems Addressed by the Paper This paper aims to address the issue of how to improve the efficiency and effectiveness of ordinary users in teaching robots tasks through Learning from Demonstration (LfD). Specifically, the paper focuses on the following aspects: 1. **Improving Robot Learning Efficiency**: The LfD framework allows non-expert users to easily program robots, but the learning efficiency and generalization ability of robots depend on the quality and quantity of the demonstrations provided. The goal of the paper is to guide human teachers to provide more effective demonstrations, thereby improving the robot's learning efficiency. 2. **Optimizing Demonstration Selection**: To achieve this goal, the paper proposes using task-related information entropy as a criterion, suggesting that human teachers provide the most informative demonstration examples to improve their teaching skills. 3. **Enhancing User Teaching Ability**: Through experimental validation, the paper uses an Augmented Reality (AR) system to train novice users to provide additional demonstrations, especially in areas of the workspace with the highest uncertainty. After several training sessions, these users are able to teach the robot a generalizable task with a limited number of demonstrations. 4. **Evaluating User Performance**: In the experiments, user performance was evaluated on the same task (retention test) and new tasks (transfer test). The results showed that users employing the information entropy method improved their performance on new tasks by 198%, and the robot's learning efficiency increased by 210% compared to existing heuristic rules. ### Main Contributions - **Information Entropy as a Guiding Criterion**: Introducing task-related information entropy as a criterion for selecting demonstrations to improve the robot's learning efficiency and generalization ability. - **Application of AR System**: Using an AR system to help novice users identify and provide the most informative demonstrations, thereby optimizing the teaching process. - **Experimental Validation**: Experimentally validating the effectiveness of the proposed method, particularly the significant improvement on new tasks. ### Conclusion By introducing information entropy as a criterion for selecting demonstrations and combining it with AR technology, the paper successfully improves the efficiency and effectiveness of ordinary users in teaching robots tasks. This not only helps to enhance the robot's learning efficiency but also provides new ideas and methods for future robot teaching.