Task Planning for Object Rearrangement in Multi-room Environments

Karan Mirakhor,Sourav Ghosh,Dipanjan Das,Brojeshwar Bhowmick

2024-06-01

Abstract:Object rearrangement in a multi-room setup should produce a reasonable plan that reduces the agent's overall travel and the number of steps. Recent state-of-the-art methods fail to produce such plans because they rely on explicit exploration for discovering unseen objects due to partial observability and a heuristic planner to sequence the actions for rearrangement. This paper proposes a novel hierarchical task planner to efficiently plan a sequence of actions to discover unseen objects and rearrange misplaced objects within an untidy house to achieve a desired tidy state. The proposed method introduces several novel techniques, including (i) a method for discovering unseen objects using commonsense knowledge from large language models, (ii) a collision resolution and buffer prediction method based on Cross-Entropy Method to handle blocked goal and swap cases, (iii) a directed spatial graph-based state space for scalability, and (iv) deep reinforcement learning (RL) for producing an efficient planner. The planner interleaves the discovery of unseen objects and rearrangement to minimize the number of steps taken and overall traversal of the agent. The paper also presents new metrics and a benchmark dataset called MoPOR to evaluate the effectiveness of the rearrangement planning in a multi-room setting. The experimental results demonstrate that the proposed method effectively addresses the multi-room rearrangement problem.

Robotics

What problem does this paper attempt to address?

### The Problem Addressed by This Paper This paper primarily addresses the task planning problem of object rearrangement in multi-room environments. Specifically: 1. **Efficient Planning under Partial Observability**: - The paper proposes a novel hierarchical task planning method to tackle the object rearrangement problem under partially observable conditions in multi-room environments. This method aims to improve task efficiency by minimizing the agent's movement steps and total travel distance. 2. **Unseen Object Discovery**: - A method for unseen object discovery based on large language models (LLMs) is proposed, utilizing common-sense knowledge to predict the possible rooms and container locations of unseen objects. 3. **Collision Resolution and Buffer Management**: - A collision resolution strategy based on the Cross-Entropy Method is introduced to handle object collisions and situations where target positions are blocked or need to be swapped. 4. **Deep Reinforcement Learning Planner**: - A Deep Reinforcement Learning (Deep RL) planner is used to generate effective action sequences for simultaneously searching for unseen objects and rearranging seen objects, thereby optimizing the agent's movement path and the number of steps. 5. **Benchmark Dataset**: - To evaluate the effectiveness of the method, a new benchmark dataset, MoPOR, is introduced for testing task planners in multi-room environments, overcoming the limitations of existing datasets. In summary, the main goal of this paper is to achieve efficient object rearrangement tasks in multi-room environments under partially observable conditions through effective planning strategies, while reducing the agent's travel distance and number of steps.

Task Planning for Object Rearrangement in Multi-room Environments

MANER: Multi-Agent Neural Rearrangement Planning of Objects in Cluttered Environments

Hierarchical Object-Oriented POMDP Planning for Object Rearrangement

Effectively Rearranging Heterogeneous Objects on Cluttered Tabletops

Object Rearrangement with Nested Nonprehensile Manipulation Actions

Monte-Carlo Tree Search for Efficient Visually Guided Rearrangement Planning

Multi-Stage Monte Carlo Tree Search for Non-Monotone Object Rearrangement Planning in Narrow Confined Environments

Visual Room Rearrangement

Rearrangement Planning via Heuristic Search

Rearrange Indoor Scenes for Human-Robot Co-Activity

Integrating Active Sensing and Rearrangement Planning for Efficient Object Retrieval from Unknown, Confined, Cluttered Environments

Toward Efficient Task Planning for Dual-Arm Tabletop Object Rearrangement

Language-guided Active Sensing of Confined, Cluttered Environments via Object Rearrangement Planning

Neural Rearrangement Planning for Object Retrieval from Confined Spaces Perceivable by Robot's In-hand RGB-D Sensor

Physical Reasoning and Object Planning for Household Embodied Agents

Object-Centric Kinodynamic Planning for Nonprehensile Robot Rearrangement Manipulation

Synchronized Multi-Arm Rearrangement Guided by Mode Graphs with Capacity Constraints

LLM-enhanced Scene Graph Learning for Household Rearrangement

Task Allocation for Multi-Robot Task and Motion Planning: a case for Object Picking in Cluttered Workspaces

Hierarchical Policy for Non-prehensile Multi-object Rearrangement with Deep Reinforcement Learning and Monte Carlo Tree Search