Large Language Models as Planning Domain Generators

James Oswald,Kavitha Srinivas,Harsha Kokel,Junkyu Lee,Michael Katz,Shirin Sohrabi
DOI: https://doi.org/10.48550/arXiv.2405.06650
2024-04-03
Abstract:Developing domain models is one of the few remaining places that require manual human labor in AI planning. Thus, in order to make planning more accessible, it is desirable to automate the process of domain model generation. To this end, we investigate if large language models (LLMs) can be used to generate planning domain models from simple textual descriptions. Specifically, we introduce a framework for automated evaluation of LLM-generated domains by comparing the sets of plans for domain instances. Finally, we perform an empirical analysis of 7 large language models, including coding and chat models across 9 different planning domains, and under three classes of natural language domain descriptions. Our results indicate that LLMs, particularly those with high parameter counts, exhibit a moderate level of proficiency in generating correct planning domains from natural language descriptions. Our code is available at
Computation and Language,Artificial Intelligence
What problem does this paper attempt to address?
The paper discusses how to use large language models (LLMs) to automatically generate planning domain models, thus reducing the burden of manual modeling in AI planning. The study evaluates the generated models automatically by comparing the equivalence between LLM-generated planning domains and the original textual descriptions, and analyzes the impact of different LLMs and types of natural language descriptions on the generation effectiveness. The experiments show that LLMs with a large number of parameters, in particular, can accurately generate planning domain models from natural language descriptions to a certain extent.