Receive, Reason, and React: Drive as You Say with Large Language Models in Autonomous Vehicles

Can Cui,Yunsheng Ma,Xu Cao,Wenqian Ye,Ziran Wang

DOI: https://doi.org/10.48550/arXiv.2310.08034

2023-10-12

Abstract:The fusion of human-centric design and artificial intelligence (AI) capabilities has opened up new possibilities for next-generation autonomous vehicles that go beyond transportation. These vehicles can dynamically interact with passengers and adapt to their preferences. This paper proposes a novel framework that leverages Large Language Models (LLMs) to enhance the decision-making process in autonomous vehicles. By utilizing LLMs' linguistic and contextual understanding abilities with specialized tools, we aim to integrate the language and reasoning capabilities of LLMs into autonomous vehicles. Our research includes experiments in HighwayEnv, a collection of environments for autonomous driving and tactical decision-making tasks, to explore LLMs' interpretation, interaction, and reasoning in various scenarios. We also examine real-time personalization, demonstrating how LLMs can influence driving behaviors based on verbal commands. Our empirical results highlight the substantial advantages of utilizing chain-of-thought prompting, leading to improved driving decisions, and showing the potential for LLMs to enhance personalized driving experiences through ongoing verbal feedback. The proposed framework aims to transform autonomous vehicle operations, offering personalized support, transparent decision-making, and continuous learning to enhance safety and effectiveness. We achieve user-centric, transparent, and adaptive autonomous driving ecosystems supported by the integration of LLMs into autonomous vehicles.

Human-Computer Interaction,Artificial Intelligence,Robotics

What problem does this paper attempt to address?

The paper attempts to address the issue of exploring how to integrate large language models (LLMs) into autonomous vehicles to enhance the vehicle's decision-making capabilities, interactivity, and personalization levels. Specifically, the research aims to: 1. **Enhance Decision-Making Capabilities**: Improve the decision-making process of autonomous vehicles in various scenarios by leveraging the language understanding and reasoning abilities of LLMs. 2. **Achieve Natural Language Interaction**: Enable autonomous vehicles to interact with drivers through natural language, facilitating more intuitive and human-like communication. 3. **Context Understanding and Reasoning**: Enhance the vehicle's understanding of information such as traffic regulations and accident reports, ensuring that decisions prioritize safety and compliance with regulations. 4. **Zero-Shot Planning**: Enable the vehicle to understand and handle unfamiliar scenarios without prior experience. 5. **Continuous Learning and Personalization**: Continuously learn to adapt to the driver's preferences and improve the driving experience over time. 6. **Transparency and Trust Building**: Allow LLMs to explain their decision-making process in simple language, thereby enhancing user trust in the technology. Through these efforts, researchers hope to develop a human-centered autonomous driving ecosystem where LLMs act as the "brain," while perception modules, positioning systems, and other onboard devices serve as the "eyes" and "hands," working together to accomplish complex driving tasks.

Receive, Reason, and React: Drive as You Say with Large Language Models in Autonomous Vehicles

Receive, Reason, and React: Drive as You Say, With Large Language Models in Autonomous Vehicles

Drive as You Speak: Enabling Human-Like Interaction with Large Language Models in Autonomous Vehicles

Personalized Autonomous Driving with Large Language Models: Field Experiments

Large Language Models for Autonomous Driving (LLM4AD): Concept, Benchmark, Simulation, and Real-Vehicle Experiment

LanguageMPC: Large Language Models as Decision Makers for Autonomous Driving

Drive Like a Human: Rethinking Autonomous Driving with Large Language Models

A Language Agent for Autonomous Driving

Empowering Autonomous Driving with Large Language Models: A Safety Perspective

Human-Centric Autonomous Systems with LLMs for User Command Reasoning

Towards Interactive and Learnable Cooperative Driving Automation: a Large Language Model-Driven Decision-Making Framework

Towards Proactive Interactions for In-Vehicle Conversational Assistants Utilizing Large Language Models

DriveMLM: Aligning Multi-Modal Large Language Models with Behavioral Planning States for Autonomous Driving

LLM4Drive: A Survey of Large Language Models for Autonomous Driving

On-Board Vision-Language Models for Personalized Autonomous Vehicle Motion Control: System Design and Real-World Validation

Hybrid Reasoning Based on Large Language Models for Autonomous Car Driving

Large Language Models for Human-like Autonomous Driving: A Survey

A Survey on Multimodal Large Language Models for Autonomous Driving

Evaluation of Large Language Models for Decision Making in Autonomous Driving

LMDrive: Closed-Loop End-to-End Driving with Large Language Models

AgentsCoDriver: Large Language Model Empowered Collaborative Driving with Lifelong Learning