Abstract:Goal instructions for autonomous AI agents cannot assume that objects have unique names. Instead, objects in goals must be referred to by providing suitable descriptions. However, this raises problems in both classical planning and generalized planning. The standard approach to handling existentially quantified goals in classical planning involves compiling them into a DNF formula that encodes all possible variable bindings and adding dummy actions to map each DNF term into the new, dummy goal. This preprocessing is exponential in the number of variables. In generalized planning, the problem is different: even if general policies can deal with any initial situation and goal, executing a general policy requires the goal to be grounded to define a value for the policy features. The problem of grounding goals, namely finding the objects to bind the goal variables, is subtle: it is a generalization of classical planning, which is a special case when there are no goal variables to bind, and constraint reasoning, which is a special case when there are no actions. In this work, we address the goal grounding problem with a novel supervised learning approach. A GNN architecture, trained to predict the cost of partially quantified goals over small domain instances is tested on larger instances involving more objects and different quantified goals. The proposed architecture is evaluated experimentally over several planning domains where generalization is tested along several dimensions including the number of goal variables and objects that can bind such variables. The scope of the approach is also discussed in light of the known relationship between GNNs and C2 logics.
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how to find appropriate instantiation objects for existentially quantified goals, thereby achieving the "grounding" of the goals. Specifically, the paper focuses on dealing with goal instructions that do not rely on unique object names in classical planning and generalized planning. This type of problem is very important in practical applications because when autonomous AI agents perform tasks, they usually cannot assume that all objects have unique names and need to refer to objects through descriptions.
### Problem Background
1. **Challenges in Classical Planning**:
- In classical planning, goals are usually described by the unique names of objects. For example, "Put block A on block B", where A and B are specific blocks.
- However, when goals use existential quantifiers (such as "Put the large yellow ball next to the blue bag"), these goals cannot be directly represented by unique names but need to be qualified by descriptions.
2. **Challenges in Generalized Planning**:
- Generalized planning aims to handle different numbers of objects and goals, so goals need to be instantiated to define the values of strategy features.
- When executing a generalized strategy, goals must be instantiated, that is, specific objects need to be found to bind the goal variables.
3. **Computational Complexity**:
- The standard method for dealing with existentially quantified goals is to convert them into disjunctive normal form (DNF), which leads to exponential computational complexity.
- Even in the absence of actions, determining whether an existentially quantified goal is true is an NP - hard problem.
### The Solution of the Paper
The paper proposes a supervised - learning - based method, using a graph neural network (GNN) architecture to predict the cost of partially quantified goals, and verifies the generalization ability of this method in multiple planning domains through experiments. The specific steps are as follows:
1. **Training Stage**:
- Use small - scale instances to train the GNN model to predict the cost of partially quantified goals.
- The training data include the initial state and the goal, as well as their corresponding optimal costs.
2. **Testing Stage**:
- Test the model in larger - scale instances and evaluate its generalization ability in different dimensions, including the number of goal variables and the number of objects.
3. **Gradually Instantiating Goal Variables**:
- Instantiate goal variables one by one through a greedy algorithm, and choose the binding method that minimizes the predicted cost.
- Finally, obtain a fully instantiated goal and use a standard classical or generalized planner to generate a plan.
### Formulas and Expressions
The key formulas involved in the paper are as follows:
- Representation of Existentially Quantified Goals:
\[
\exists x, y: \text{BALL}(x) \wedge \text{LARGE}(x) \wedge \text{YELLOW}(x) \wedge \text{PACKAGE}(y) \wedge \text{BLUE}(y) \wedge \text{NEXT}(x, y)
\]
- Value Function for Goal Instantiation:
\[
V^*(s; G) = \min_{d \in D, C \in C} d \cdot [[D_{d,C}]] + N \cdot [[\neg D_{d,C}]]
\]
where \( D_{d,C} \) represents the distance \( d \) at which the robot reaches the nearest cell of color \( C \), and \( N \) is the maximum distance plus 1.
### Summary
By introducing the GNN architecture, the paper solves the instantiation problem of existentially quantified goals, providing an efficient and scalable method that can perform goal instantiation in planning problems of different scales and complexities. This method not only improves planning efficiency but also provides new ideas for goal processing in generalized planning.