GPT-in-the-Loop: Adaptive Decision-Making for Multiagent Systems

Nathalia Nascimento,Paulo Alencar,Donald Cowan
2023-08-21
Abstract:This paper introduces the "GPT-in-the-loop" approach, a novel method combining the advanced reasoning capabilities of Large Language Models (LLMs) like Generative Pre-trained Transformers (GPT) with multiagent (MAS) systems. Venturing beyond traditional adaptive approaches that generally require long training processes, our framework employs GPT-4 for enhanced problem-solving and explanation skills. Our experimental backdrop is the smart streetlight Internet of Things (IoT) application. Here, agents use sensors, actuators, and neural networks to create an energy-efficient lighting system. By integrating GPT-4, these agents achieve superior decision-making and adaptability without the need for extensive training. We compare this approach with both traditional neuroevolutionary methods and solutions provided by software engineers, underlining the potential of GPT-driven multiagent systems in IoT. Structurally, the paper outlines the incorporation of GPT into the agent-driven Framework for the Internet of Things (FIoT), introduces our proposed GPT-in-the-loop approach, presents comparative results in the IoT context, and concludes with insights and future directions.
Multiagent Systems,Artificial Intelligence,Neural and Evolutionary Computing,Software Engineering
What problem does this paper attempt to address?
The problem this paper attempts to address is: how to utilize the advanced reasoning capabilities of large language models (such as GPT-4) in multi-agent systems (MAS) to improve decision-making efficiency and adaptability without requiring long training processes. Specifically, the paper proposes the "GPT-in-the-loop" method, which combines GPT-4 with multi-agent systems and applies it to an intelligent streetlight Internet of Things (IoT) scenario. In this way, agents in the intelligent streetlight system can make decisions more efficiently and quickly adapt to changing environments without the need for traditional long training periods. Additionally, the paper explores the potential of this new method in other application scenarios, such as food foraging tasks in evolutionary robotics.