Abstract:Integrated task and motion planning (TAMP) is desirable for generalized autonomy robots but it is challenging at the same time. TAMP requires the planner to not only search in both the large symbolic task space and the high-dimension motion space but also deal with the infeasible task actions due to its intrinsic hierarchical process. We propose a novel decision-making framework for TAMP by constructing an extended decision tree for both symbolic task planning and high-dimension motion variable binding. We integrate top-k planning for generating explicitly a skeleton space where a variety of candidate skeleton plans are at disposal. Moreover, we effectively combine this skeleton space with the resultant motion variable spaces into a single extended decision space. Accordingly, we use Monte-Carlo Tree Search (MCTS) to ensure an exploration-exploitation balance at each decision node and optimize globally to produce optimal solutions. The proposed seamless combination of symbolic top-k planning with streams, with the proved optimality of MCTS, leads to a powerful planning algorithm that can handle the combinatorial complexity of long-horizon manipulation tasks. We empirically evaluate our proposed algorithm in challenging robot tasks with different domains that require multi-stage decisions and show how our method can overcome the large task space and motion space through its effective tree search compared to its most competitive baseline method.
What problem does this paper attempt to address?
The paper attempts to address several key challenges in Task and Motion Planning (TAMP), specifically including:
1. **Large - scale task space**: One characteristic of TAMP problems is the existence of a large abstract state space that contains additional state variables without geometric meaning, such as whether the food has been washed in a cooking task. To make this large action space tractable, the research community usually adopts relational representation methods (such as PDDL) to compactly describe the planning domain, taking advantage of its underlying regularity.
2. **Large - scale motion space**: The skeleton must concretize its symbolic actions through specific motion values (bindings) so that they can be executed by physical robots. Therefore, low - level motion planning involving high - dimensional variables is ultimately inevitable. The motion space depends on the parameterization of abstract actions and the length of the skeleton. For example, in Figure 1b, we search for the position (x1, y1) of body1 and its orientation (w1), and in skeleton 2, the dimension of the motion space doubles to ⟨x1, y1, w1, x2, y2, w2⟩∈R^6. Due to the larger motion space in robot manipulation, an efficient motion planner is particularly important.
3. **Infeasible task skeletons**: The state abstraction in TAMP inevitably prevents the task planner from accessing detailed environmental features that are crucial to the problem solution. For example, in Figure 1a, the task planner does not receive information about the obstruction by taller objects from the symbolic initial state, so it may generate skeleton 1 in the first attempt. In AI planning research, this problem is usually described as an incomplete domain description, that is, the task planner does not have enough domain knowledge to generate completely correct results. To recover from such dead - ends, the task planner in TAMP must provide alternative skeletons and report the optimal selection skeleton for further inspection based on the binding search of motion planning.
To address the above challenges, the paper proposes an Extended Tree Search Method (eTAMP), which can solve these problems within a single framework with a simple pipeline. The main contributions of eTAMP are:
- Proposing a top - k skeleton planner based on the general top - k algorithm, which can generate diverse high - level plans in large - scale task spaces.
- Proposing an Extended Decision Tree, which models skeleton selection under infeasible task skeletons and binding search in large - scale motion spaces in the same optimal decision - making process.
The paper also demonstrates the ability of eTAMP to find feasible plans at various levels of difficulty by evaluating the proposed algorithm in challenging robot tasks in different domains.