Abstract:Purpose Autonomous robots must be able to understand long-term manipulation tasks described by humans and perform task analysis and planning based on the current environment in a variety of scenes, such as daily manipulation and industrial assembly. However, both classical task and motion planning algorithms and single data-driven learning planning methods have limitations in practicability, generalization and interpretability. The purpose of this work is to overcome the limitations of the above methods and achieve generalized and explicable long-term robot manipulation task planning. Design/methodology/approach The authors propose a planning method for long-term manipulation tasks that combines the advantages of existing methods and the prior cognition brought by the knowledge graph. This method integrates visual semantic understanding based on scene graph generation, regression planning based on deep learning and multi-level representation and updating based on a knowledge base. Findings The authors evaluated the capability of this method in a kitchen cooking task and tabletop arrangement task in simulation and real-world environments. Experimental results show that the proposed method has a significantly improved success rate compared with the baselines and has excellent generalization performance for new tasks. Originality/value The authors demonstrate that their method is scalable to long-term manipulation tasks with varying complexity and visibility. This advantage allows their method to perform better in new manipulation tasks. The planning method proposed in this work is meaningful for the present robot manipulation task and can be intuitive for similar high-level robot planning.

Substructure-Preserving Generalization for On-Site Robotic Construction Planning

Robot-Enabled Construction Assembly with Automated Sequence Planning based on ChatGPT: RoboGPT

Learning Parameterized Task Structure for Generalization to Unseen Entities

Transferring Foundation Models for Generalizable Robotic Manipulation

Pave the Way to Grasp Anything: Transferring Foundation Models for Universal Pick-Place Robots

Rearrangement Planning for General Part Assembly

Structural Concept Learning via Graph Attention for Multi-Level Rearrangement Planning

Long-term Robot Manipulation Task Planning with Scene Graph and Semantic Knowledge

Learning to Design and Construct Bridge Without Blueprint

Deep Reinforcement Learning-based Task Assignment and Path Planning for Multi-agent Construction Robots

Sketch-Plan-Generalize: Continual Few-Shot Learning of Inductively Generalizable Spatial Concepts

A Formal Framework for Robot Construction Problems: A Hybrid Planning Approach

SayPlan: Grounding Large Language Models using 3D Scene Graphs for Scalable Robot Task Planning

Generalization of Compositional Tasks with Logical Specification via Implicit Planning

Long-Horizon Multi-Robot Rearrangement Planning for Construction Assembly

Robot Active Neural Sensing and Planning in Unknown Cluttered Environments

GRID: Scene-Graph-based Instruction-driven Robotic Task Planning

From Abstractions to Grounded Languages for Robust Coordination of Task Planning Robots

A Framework to Co-Optimize Robot Exploration and Task Planning in Unknown Environments

Object-Aware View Planning for Autonomous 3-D Model Reconstruction of Buildings Using a Mobile Robot

An Enhanced Hierarchical Planning Framework for Multi-Robot Autonomous Exploration