Learning for Long-Horizon Planning via Neuro-Symbolic Abductive Imitation

Jie-Jing Shao,Hao-Ran Hao,Xiao-Wen Yang,Yu-Feng Li

2024-11-27

Abstract:Recent learning-to-imitation methods have shown promising results in planning via imitating within the observation-action space. However, their ability in open environments remains constrained, particularly in long-horizon tasks. In contrast, traditional symbolic planning excels in long-horizon tasks through logical reasoning over human-defined symbolic spaces but struggles to handle observations beyond symbolic states, such as high-dimensional visual inputs encountered in real-world scenarios. In this work, we draw inspiration from abductive learning and introduce a novel framework \textbf{AB}ductive \textbf{I}mitation \textbf{L}earning (ABIL) that integrates the benefits of data-driven learning and symbolic-based reasoning, enabling long-horizon planning. Specifically, we employ abductive reasoning to understand the demonstrations in symbolic space and design the principles of sequential consistency to resolve the conflicts between perception and reasoning. ABIL generates predicate candidates to facilitate the perception from raw observations to symbolic space without laborious predicate annotations, providing a groundwork for symbolic planning. With the symbolic understanding, we further develop a policy ensemble whose base policies are built with different logical objectives and managed through symbolic reasoning. Experiments show that our proposal successfully understands the observations with the task-relevant symbolics to assist the imitation learning. Importantly, ABIL demonstrates significantly improved data efficiency and generalization across various long-horizon tasks, highlighting it as a promising solution for long-horizon planning. Project website: \url{<a class="link-external link-https" href="https://www.lamda.nju.edu.cn/shaojj/KDD25_ABIL/" rel="external noopener nofollow">this https URL</a>}.

Machine Learning,Artificial Intelligence

What problem does this paper attempt to address?

The problem that this paper attempts to solve is how to combine data - driven learning methods and symbol - based reasoning methods in long - term planning tasks in open environments to improve the performance of imitation learning. Specifically, the paper points out that although current imitation learning methods have achieved good results through imitation within the observation - action space, their performance in open environments, especially in long - term tasks, is still limited. While traditional symbolic planning methods perform well through logical reasoning in long - term tasks, they have difficulty handling observations beyond symbolic states, such as high - dimensional visual inputs encountered in real - world scenarios. Therefore, the paper proposes a new framework - Abductive Imitation Learning (ABIL), aiming to achieve long - term planning capabilities by combining the advantages of data - driven learning and symbolic reasoning. The main contributions of the ABIL framework are: 1. **Introduction of abductive reasoning**: It is used to understand the demonstrations in the symbolic space, and the sequential consistency principle is designed to resolve the conflict between perception and reasoning. 2. **Generation of predicate candidates**: Symbolic information can be extracted from raw observations without cumbersome predicate annotations, providing a basis for symbolic planning. 3. **Development of policy integration**: Its basic policies are constructed based on different logical goals and managed by symbolic reasoning, directly learning specific behaviors from demonstrations, reducing the dependence on previous low - level controllers. The experimental results show that ABIL not only successfully understands task - related symbolic information from observations and assists imitation learning, but also significantly improves data efficiency and generalization ability in various long - term tasks. This indicates that ABIL is a promising solution to the long - term planning problem.

Learning for Long-Horizon Planning via Neuro-Symbolic Abductive Imitation

Learning adaptive planning representations with natural language guidance

Hierarchical Reinforcement Learning with Abductive Planning

Embodied Active Learning of Relational State Abstractions for Bilevel Planning

Imperative Learning: A Self-supervised Neural-Symbolic Learning Framework for Robot Autonomy

VisualPredicator: Learning Abstract World Models with Neuro-Symbolic Predicates for Robot Planning

Improving Long-Horizon Imitation Through Instruction Prediction

Planning to Learn: A Novel Algorithm for Active Learning during Model-Based Planning

Learning Temporally Extended Skills in Continuous Domains as Symbolic Actions for Planning

Learning Planning Abstractions from Language

A framework for neurosymbolic robot action planning using large language models

Learning Neuro-Symbolic Relational Transition Models for Bilevel Planning

Synthesizing Evolving Symbolic Representations for Autonomous Systems

Deep Imitative Models for Flexible Inference, Planning, and Control

Predicate Invention for Bilevel Planning

Active Learning of Abstract Plan Feasibility

Classical Planning in Deep Latent Space: Bridging the Subsymbolic-Symbolic Boundary

LIRL: Latent Imagination-Based Reinforcement Learning for Efficient Coverage Path Planning

SDRL: Interpretable and Data-efficient Deep Reinforcement Learning Leveraging Symbolic Planning

INCREMENTAL LEARNING OF PROCEDURAL PLANNING KNOWLEDGE IN CHALLENGING ENVIRONMENTS