Closed-loop Teaching via Demonstrations to Improve Policy Transparency

Michael S. Lee,Reid Simmons,Henny Admoni
2024-04-01
Abstract:Demonstrations are a powerful way of increasing the transparency of AI policies. Though informative demonstrations may be selected a priori through the machine teaching paradigm, student learning may deviate from the preselected curriculum in situ. This paper thus explores augmenting a curriculum with a closed-loop teaching framework inspired by principles from the education literature, such as the zone of proximal development and the testing effect. We utilize tests accordingly to close to the loop and maintain a novel particle filter model of human beliefs throughout the learning process, allowing us to provide demonstrations that are targeted to the human's current understanding in real time. A user study finds that our proposed closed-loop teaching framework reduces the regret in human test responses by 43% over a baseline.
Computers and Society,Artificial Intelligence
What problem does this paper attempt to address?
The problem that this paper attempts to solve is: **How to improve the transparency of AI strategies through demonstrations and ensure that learners (humans) can better understand and predict the behavior of AI**. ### Specific problem description 1. **Limitations of existing methods**: - Although complex and efficient AI strategies can be trained through reinforcement learning, ensuring the transparency (i.e., comprehensibility and predictability) of these strategies in all scenarios remains a challenge. - Although traditional machine - teaching methods can select an optimal set of demonstrations to help students understand AI strategies, in the actual learning process, students' learning trajectories may deviate from the pre - designed teaching paths. For example, students may not be able to understand certain concepts in a timely manner, resulting in poor subsequent learning effects. 2. **The need to improve transparency**: - Improving the transparency of AI strategies is crucial for calibrating the expectations of developers and end - users to ensure that they can use AI systems correctly. - Existing methods are usually tested after providing demonstrations and fail to adjust the teaching content in real - time to adapt to the current understanding level of students. ### The method proposed in the paper To overcome the above problems, this paper proposes a **closed - loop teaching framework**, which combines principles in pedagogy (such as the zone of proximal development and the testing effect) to improve the transparency of AI strategies in the following ways: - **Real - time adjustment of teaching content**: Use testing and feedback mechanisms to dynamically adjust the demonstration content during the student's learning process, ensuring that each demonstration is in line with the student's current understanding level. - **Maintaining the human belief model**: Use a particle filter model to update the understanding of students' beliefs in real - time, thereby selecting the most appropriate demonstration content. - **Teaching for individual differences**: Considering that different students have different learning rates, evaluate students' learning progress through regular tests and provide additional guidance as needed. ### Main contributions 1. **Closed - loop teaching framework**: Combining pedagogical principles, providing a closed - loop teaching method that realizes real - time adjustment of teaching content through demonstrations, tests, and feedback. 2. **Particle filter model**: A human belief model that supports iterative update and calibration, which can efficiently estimate students' understanding of AI strategies. 3. **User study verification**: The effectiveness of this framework has been verified through user studies, and the results show that the closed - loop teaching framework can reduce the regret of human test responses by 43%. In conclusion, this paper aims to significantly improve the transparency of AI strategies and human understanding of their behavior by introducing a closed - loop teaching framework and combining effective teaching strategies in pedagogy.