Abstract:Learning from Demonstration (LfD) is a powerful type of machine learning that can allow novices to teach and program robots to complete various tasks. However, the learning process for these systems may still be difficult for novices to interpret and understand, making effective teaching challenging. Explainable artificial intelligence (XAI) aims to address this challenge by explaining a system to the user. In this work, we investigate XAI within LfD by implementing an adaptive explanatory feedback system on an inverse reinforcement learning (IRL) algorithm. The feedback is implemented by demonstrating selected learnt trajectories to users. The system adapts to user teaching by categorizing and then selectively sampling trajectories shown to a user, to show a representative sample of both successful and unsuccessful trajectories. The system was evaluated through a user study with 26 participants teaching a robot a navigation task. The results of the user study demonstrated that the proposed explanatory feedback system can improve robot performance, teaching efficiency and user understanding of the robot.

What problem does this paper attempt to address?

### Problems the Paper Attempts to Solve This paper attempts to address the issue that novice users find it difficult to understand and interpret the robot learning process in "Learning from Demonstration" (LfD). Specifically, LfD is a powerful machine learning method that allows non-expert users to teach robots to perform various tasks through demonstrations. However, due to the black-box nature of learning algorithms, novice users may not understand the robot's learning strategies, leading to poor teaching effectiveness. To solve this problem, the paper introduces Explainable Artificial Intelligence (XAI) technology, which provides adaptive explanatory feedback to help users better understand the robot's behavior, thereby improving teaching efficiency and robot performance. ### Main Contributions 1. **Proposed an adaptive explanatory feedback system based on demonstrations**: This system can automatically generate explanatory trajectories and adaptively sample according to the user's teaching, showing both successful and failed task executions. 2. **First evaluated the impact of XAI on user understanding in LfD algorithms**: Through user studies, it was verified whether XAI can improve users' understanding of robot learning strategies. 3. **Demonstrated the positive impact of explanatory feedback on user teaching and understanding**: Experimental results show that explanatory feedback not only improves users' understanding of robot behavior but also enhances the robot's learning performance. ### Experimental Design - **Task Setup**: Users teach a robot navigation task in a grid world environment, with the goal of having the robot navigate from any state in the environment to a set of target locations. - **Experimental Conditions**: The experiment was divided into two condition groups: the Explanatory Feedback group (EF) and the No Feedback group (NF). The EF group received explanatory trajectories after each demonstration, while the NF group only saw explanations after the task was completed. - **Evaluation Metrics**: These included robot performance, teaching efficiency, user prediction accuracy and certainty, and user perception (such as satisfaction, understanding, teaching performance, and cognitive load). ### Results - **Performance Metrics**: The robot performance and teaching efficiency of the EF group were significantly higher than those of the NF group. All participants in the EF group achieved 100% robot performance, while 6 participants in the NF group failed to reach 100% performance after 10 demonstrations. - **Prediction Metrics**: Users in the EF group showed significantly improved accuracy in goal and action predictions, although prediction certainty did not significantly increase. Nonetheless, users in the EF group were faster in prediction time. - **Perception Metrics**: While explanatory feedback did not significantly impact certain aspects of user perception (such as satisfaction, understanding, teaching performance, and cognitive load), visually, the user perception in the EF group was similar to that in the NF group, and the EF group showed advantages in robot performance, teaching efficiency, and user understanding. ### Discussion - **Performance Improvement**: Explanatory feedback helps users identify areas that need teaching, providing more effective demonstrations and avoiding or reducing performance stagnation. - **User Understanding**: Explanatory feedback helps users better understand the robot's learning process, thereby improving teaching efficiency and robot performance. In summary, this paper effectively addresses the issue of novice users struggling to understand robot learning strategies in the LfD process by introducing an adaptive explanatory feedback system, thereby improving teaching efficiency and robot performance.

Demonstration Based Explainable AI for Learning from Demonstration Methods

Learning from Suboptimal Demonstration via Self-Supervised Reward Regression

Fast Lifelong Adaptive Inverse Reinforcement Learning from Demonstrations

How Can Everyday Users Efficiently Teach Robots by Demonstrations?

A survey of robot learning from demonstration

Enabling Robots to Identify Missing Steps in Robot Tasks for Guided Learning from Demonstration

Recent Advances in Robot Learning from Demonstration

Demonstration Guided Actor-Critic Deep Reinforcement Learning for Fast Teaching of Robots in Dynamic Environments

Explainable artificial intelligence for education and training

Autonomous Assessment of Demonstration Sufficiency via Bayesian Inverse Reinforcement Learning

Training Humans to Train Robots Dynamic Motor Skills

Instructing Robots by Sketching: Learning from Demonstration via Probabilistic Diagrammatic Teaching

Learning from Successful and Failed Demonstrations via Optimization

A Survey of Demonstration Learning

Learning from Hindsight Demonstrations.

Programming-by-Demonstration for Long-Horizon Robot Tasks

Strategy Discovery and Mixture in Lifelong Learning from Heterogeneous Demonstration

Exploration-efficient Deep Reinforcement Learning with Demonstration Guidance for Robot Control

Explainable AI via Learning to Optimize

Adaptive Language-Guided Abstraction from Contrastive Explanations

Explainable Artificial Intelligence (XAI) for Increasing User Trust in Deep Reinforcement Learning Driven Autonomous Systems