The Role of Higher-Order Cognitive Models in Active Learning

Oskar Keurulainen,Gokhan Alcan,Ville Kyrki
2024-01-09
Abstract:Building machines capable of efficiently collaborating with humans has been a longstanding goal in artificial intelligence. Especially in the presence of uncertainties, optimal cooperation often requires that humans and artificial agents model each other's behavior and use these models to infer underlying goals, beliefs or intentions, potentially involving multiple levels of recursion. Empirical evidence for such higher-order cognition in human behavior is also provided by previous works in cognitive science, linguistics, and robotics. We advocate for a new paradigm for active learning for human feedback that utilises humans as active data sources while accounting for their higher levels of agency. In particular, we discuss how increasing level of agency results in qualitatively different forms of rational communication between an active learning system and a teacher. Additionally, we provide a practical example of active learning using a higher-order cognitive model. This is accompanied by a computational study that underscores the unique behaviors that this model produces.
Machine Learning,Robotics
What problem does this paper attempt to address?
The problem this paper attempts to address is: how to improve the effectiveness of human-computer interaction in active learning by introducing higher-order cognitive models when utilizing human feedback. Specifically, the paper explores how modeling human multi-level cognitive abilities (such as the ability to understand others' intentions) can enable machines to interact more effectively with humans, thereby making more efficient use of the information provided by humans during the learning process. The core of the paper lies in proposing a new active learning paradigm that not only views humans as a data source but also considers human advanced agency. The authors believe that this approach can achieve higher quality bidirectional communication, thereby enhancing the performance of machine learning systems. The paper demonstrates the effectiveness of this method through theoretical analysis and experiments and discusses directions for future research.