LaMPilot: An Open Benchmark Dataset for Autonomous Driving with Language Model Programs

Yunsheng Ma,Can Cui,Xu Cao,Wenqian Ye,Peiran Liu,Juanwu Lu,Amr Abdelraouf,Rohit Gupta,Kyungtae Han,Aniket Bera,James M. Rehg,Ziran Wang

2024-04-04

Abstract:Autonomous driving (AD) has made significant strides in recent years. However, existing frameworks struggle to interpret and execute spontaneous user instructions, such as "overtake the car ahead." Large Language Models (LLMs) have demonstrated impressive reasoning capabilities showing potential to bridge this gap. In this paper, we present LaMPilot, a novel framework that integrates LLMs into AD systems, enabling them to follow user instructions by generating code that leverages established functional primitives. We also introduce LaMPilot-Bench, the first benchmark dataset specifically designed to quantitatively evaluate the efficacy of language model programs in AD. Adopting the LaMPilot framework, we conduct extensive experiments to assess the performance of off-the-shelf LLMs on LaMPilot-Bench. Our results demonstrate the potential of LLMs in handling diverse driving scenarios and following user instructions in driving. To facilitate further research in this area, we release our code and data at <a class="link-external link-https" href="https://github.com/PurdueDigitalTwin/LaMPilot" rel="external noopener nofollow">this https URL</a>.

Computation and Language,Artificial Intelligence

What problem does this paper attempt to address?

This paper focuses on how to enable autonomous driving systems to understand and execute user instructions in natural language. Current autonomous driving frameworks face difficulties in handling such unstructured user instructions. The paper proposes a new framework called LaMPilot, which combines large language models (LLMs) to generate code that translates natural language instructions into executable driving plans using existing functional primitives. LaMPilot-Bench is a newly introduced benchmark dataset specifically designed to quantitatively evaluate the performance of LLMs in autonomous driving tasks. This dataset includes a series of tasks described in natural language and a simulated environment for comprehensive evaluation of agent strategy performance. Through the LaMPilot framework, researchers conducted extensive experiments with existing LLMs, and the results suggest the potential of LLMs in handling various driving scenarios and following driving instructions. Additionally, they proposed a baseline method based on human feedback, which integrates human guidance into the decision-making process of LLMs to improve their performance. Overall, this paper aims to address how autonomous driving systems can better understand and respond to user instructions in natural language. By integrating LLMs with traditional autonomous driving algorithms, the flexibility and interpretability of the system are improved.

LaMPilot: An Open Benchmark Dataset for Autonomous Driving with Language Model Programs

Large Language Models for Autonomous Driving (LLM4AD): Concept, Benchmark, Simulation, and Real-Vehicle Experiment

LMDrive: Closed-Loop End-to-End Driving with Large Language Models

Drive Like a Human: Rethinking Autonomous Driving with Large Language Models

DriveMLM: Aligning Multi-Modal Large Language Models with Behavioral Planning States for Autonomous Driving

LLM4Drive: A Survey of Large Language Models for Autonomous Driving

LanguageMPC: Large Language Models as Decision Makers for Autonomous Driving

Driving Everywhere with Large Language Model Policy Adaptation

Empowering Autonomous Driving with Large Language Models: A Safety Perspective

Personalized Autonomous Driving with Large Language Models: Field Experiments

Large Language Models for Human-like Autonomous Driving: A Survey

Driving with LLMs: Fusing Object-Level Vector Modality for Explainable Autonomous Driving

MAPLM: A Real-World Large-Scale Vision-Language Benchmark for Map and Traffic Scene Understanding

DriveMLLM: A Benchmark for Spatial Understanding with Multimodal Large Language Models in Autonomous Driving

Facilitating Autonomous Driving Tasks with Large Language Models

DriveGPT4: Interpretable End-to-end Autonomous Driving via Large Language Model

DriveLLM: Charting the Path Toward Full Autonomous Driving with Large Language Models

SurrealDriver: Designing LLM-powered Generative Driver Agent Framework based on Human Drivers' Driving-thinking Data

Prompting Multi-Modal Tokens to Enhance End-to-End Autonomous Driving Imitation Learning with LLMs

A Language Agent for Autonomous Driving

A Survey on Large Language Model-empowered Autonomous Driving