Traffic Scene Generation from Natural Language Description for Autonomous Vehicles with Large Language Model

Bo-Kai Ruan,Hao-Tang Tsui,Yung-Hui Li,Hong-Han Shuai

2024-09-15

Abstract:Text-to-scene generation, transforming textual descriptions into detailed scenes, typically relies on generating key scenarios along predetermined paths, constraining environmental diversity and limiting customization flexibility. To address these limitations, we propose a novel text-to-traffic scene framework that leverages a large language model to generate diverse traffic scenarios within the Carla simulator based on natural language descriptions. Users can define specific parameters such as weather conditions, vehicle types, and road signals, while our pipeline can autonomously select the starting point and scenario details, generating scenes from scratch without relying on predetermined locations or trajectories. Furthermore, our framework supports both critical and routine traffic scenarios, enhancing its applicability. Experimental results indicate that our approach promotes diverse agent planning and road selection, enhancing the training of autonomous agents in traffic environments. Notably, our methodology has achieved a 16% reduction in average collision rates. Our work is made publicly available at <a class="link-external link-https" href="https://basiclab.github.io/TTSG" rel="external noopener nofollow">this https URL</a>.

Robotics

What problem does this paper attempt to address?

The problem that this paper attempts to solve is that in autonomous vehicles, the ability to generate traffic scenes through natural - language descriptions is limited. Existing methods usually rely on predefined paths and locations, which restricts the diversity of the environment and the flexibility of customization. To overcome these limitations, the authors propose a new text - to - traffic - scene generation framework based on large - language models (LLMs). This framework can generate diverse traffic scenes in the Carla simulator according to natural - language descriptions. Users can define specific parameters such as weather conditions, vehicle types, and road signals, etc., and the system can independently select starting points and scene details without relying on predefined locations or trajectories. In addition, this framework supports critical and regular traffic scenes, enhancing its applicability, and the experimental results show that this method promotes diverse agent planning and road selection, improves the training effect of autonomous agents in traffic environments, especially stands out in reducing the collision rate, achieving an average 16% reduction in the collision rate.

Traffic Scene Generation from Natural Language Description for Autonomous Vehicles with Large Language Model

From Time to Space: Automatic Annotation of Unmarked Traffic Scene Based on Trajectory Data.

TrafficGen: Learning to Generate Diverse and Realistic Traffic Scenarios.

Generating Traffic Scenarios via In-Context Learning to Learn Better Motion Planner

RealGen: Retrieval Augmented Generation for Controllable Traffic Scenarios

ChatScene: Knowledge-Enabled Safety-Critical Scenario Generation for Autonomous Vehicles

Language-Driven Interactive Traffic Trajectory Generation

DragTraffic: Interactive and Controllable Traffic Scene Generation for Autonomous Driving

ScenarioNet: Open-Source Platform for Large-Scale Traffic Scenario Simulation and Modeling

Trajeglish: Traffic Modeling as Next-Token Prediction

Text-to-Drive: Diverse Driving Behavior Synthesis via Large Language Models

Scenario-Based Curriculum Generation for Multi-Agent Autonomous Driving

Traffic Scenario Logic: A Spatial-Temporal Logic for Modeling and Reasoning of Urban Traffic Scenarios

Structured Scene Generation for Autonomous Driving Simulation and Digital Twin Scheduling

Multi-Vehicle Interaction Scenarios Generation with Interpretable Traffic Primitives and Gaussian Process Regression

SurrealDriver: Designing Generative Driver Agent Simulation Framework in Urban Contexts based on Large Language Model

TrafficMCTS: A Closed-Loop Traffic Flow Generation Framework with Group-Based Monte Carlo Tree Search

TrafficGPT: Towards Multi-Scale Traffic Analysis and Generation with Spatial-Temporal Agent Framework

Scenario Diffusion: Controllable Driving Scenario Generation With Diffusion

Automating Traffic Model Enhancement with AI Research Agent

Scalable Traffic Simulation for Autonomous Driving Via Multi-Agent Goal Assignment and Autoregressive Goal-Directed Planning