Deployment of Large Language Models to Control Mobile Robots at the Edge

Pascal Sikorski,Leendert Schrader,Kaleb Yu,Lucy Billadeau,Jinka Meenakshi,Naveena Mutharasan,Flavio Esposito,Hadi AliAkbarpour,Madi Babaiasl
2024-10-10
Abstract:This paper investigates the possibility of intuitive human-robot interaction through the application of Natural Language Processing (NLP) and Large Language Models (LLMs) in mobile robotics. This work aims to explore the feasibility of using these technologies for edge-based deployment, where traditional cloud dependencies are eliminated. The study specifically contrasts the performance of GPT-4-Turbo, which requires cloud connectivity, with an offline-capable, quantized version of LLaMA 2 (LLaMA 2-7B.Q5 K M). These results show that GPT-4-Turbo delivers superior performance in interpreting and executing complex commands accurately, whereas LLaMA 2 exhibits significant limitations in consistency and reliability of command execution. Communication between the control computer and the mobile robot is established via a Raspberry Pi Pico W, which wirelessly receives commands from the computer without internet dependency and transmits them through a wired connection to the robot's Arduino controller. This study highlights the potential and challenges of implementing LLMs and NLP at the edge, providing groundwork for future research into fully autonomous and network-independent robotic systems. For video demonstrations and source code, please refer to: <a class="link-external link-https" href="https://tinyurl.com/MobileRobotGPT4LLaMA2024" rel="external noopener nofollow">this https URL</a>.
Robotics
What problem does this paper attempt to address?
The paper attempts to address the issue of achieving intuitive human-robot interaction in mobile robot control, particularly in scenarios requiring robust privacy measures, high-risk decision-making, and operations in disaster or remote areas, thereby reducing reliance on cloud computing. Specifically, the main objectives of the study include: 1. **Feasibility Study**: Exploring the feasibility of applying large-scale language models (LLMs) and natural language processing (NLP) technologies to mobile robots to achieve intuitive human-robot interaction. 2. **Performance Evaluation**: Comparing the performance of the online model GPT-4-Turbo and the offline version of LLaMA 2 in practical applications, especially in scenarios lacking continuous cloud connectivity. 3. **Promoting Self-Sufficient Systems**: Deploying these models on edge devices to enable mobile robots to operate autonomously without remote server support, thereby enhancing data security and reducing latency. The focus of the research is on how to leverage advanced language models to enhance the robots' ability to understand and execute complex instructions while ensuring privacy and real-time performance, ultimately driving mobile robot systems towards full autonomy and network independence.