Learning for Long-Horizon Planning via Neuro-Symbolic Abductive Imitation

Jie-Jing Shao,Hao-Ran Hao,Xiao-Wen Yang,Yu-Feng Li
2024-11-27
Abstract:Recent learning-to-imitation methods have shown promising results in planning via imitating within the observation-action space. However, their ability in open environments remains constrained, particularly in long-horizon tasks. In contrast, traditional symbolic planning excels in long-horizon tasks through logical reasoning over human-defined symbolic spaces but struggles to handle observations beyond symbolic states, such as high-dimensional visual inputs encountered in real-world scenarios. In this work, we draw inspiration from abductive learning and introduce a novel framework \textbf{AB}ductive \textbf{I}mitation \textbf{L}earning (ABIL) that integrates the benefits of data-driven learning and symbolic-based reasoning, enabling long-horizon planning. Specifically, we employ abductive reasoning to understand the demonstrations in symbolic space and design the principles of sequential consistency to resolve the conflicts between perception and reasoning. ABIL generates predicate candidates to facilitate the perception from raw observations to symbolic space without laborious predicate annotations, providing a groundwork for symbolic planning. With the symbolic understanding, we further develop a policy ensemble whose base policies are built with different logical objectives and managed through symbolic reasoning. Experiments show that our proposal successfully understands the observations with the task-relevant symbolics to assist the imitation learning. Importantly, ABIL demonstrates significantly improved data efficiency and generalization across various long-horizon tasks, highlighting it as a promising solution for long-horizon planning. Project website: \url{<a class="link-external link-https" href="https://www.lamda.nju.edu.cn/shaojj/KDD25_ABIL/" rel="external noopener nofollow">this https URL</a>}.
Machine Learning,Artificial Intelligence
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how to combine data - driven learning methods and symbol - based reasoning methods in long - term planning tasks in open environments to improve the performance of imitation learning. Specifically, the paper points out that although current imitation learning methods have achieved good results through imitation within the observation - action space, their performance in open environments, especially in long - term tasks, is still limited. While traditional symbolic planning methods perform well through logical reasoning in long - term tasks, they have difficulty handling observations beyond symbolic states, such as high - dimensional visual inputs encountered in real - world scenarios. Therefore, the paper proposes a new framework - Abductive Imitation Learning (ABIL), aiming to achieve long - term planning capabilities by combining the advantages of data - driven learning and symbolic reasoning. The main contributions of the ABIL framework are: 1. **Introduction of abductive reasoning**: It is used to understand the demonstrations in the symbolic space, and the sequential consistency principle is designed to resolve the conflict between perception and reasoning. 2. **Generation of predicate candidates**: Symbolic information can be extracted from raw observations without cumbersome predicate annotations, providing a basis for symbolic planning. 3. **Development of policy integration**: Its basic policies are constructed based on different logical goals and managed by symbolic reasoning, directly learning specific behaviors from demonstrations, reducing the dependence on previous low - level controllers. The experimental results show that ABIL not only successfully understands task - related symbolic information from observations and assists imitation learning, but also significantly improves data efficiency and generalization ability in various long - term tasks. This indicates that ABIL is a promising solution to the long - term planning problem.