CarDreamer: Open-Source Learning Platform for World Model based Autonomous Driving

Dechen Gao,Shuangyu Cai,Hanchu Zhou,Hang Wang,Iman Soltani,Junshan Zhang
2024-07-26
Abstract:To safely navigate intricate real-world scenarios, autonomous vehicles must be able to adapt to diverse road conditions and anticipate future events. World model (WM) based reinforcement learning (RL) has emerged as a promising approach by learning and predicting the complex dynamics of various environments. Nevertheless, to the best of our knowledge, there does not exist an accessible platform for training and testing such algorithms in sophisticated driving environments. To fill this void, we introduce CarDreamer, the first open-source learning platform designed specifically for developing WM based autonomous driving algorithms. It comprises three key components: 1) World model backbone: CarDreamer has integrated some state-of-the-art WMs, which simplifies the reproduction of RL algorithms. The backbone is decoupled from the rest and communicates using the standard Gym interface, so that users can easily integrate and test their own algorithms. 2) Built-in tasks: CarDreamer offers a comprehensive set of highly configurable driving tasks which are compatible with Gym interfaces and are equipped with empirically optimized reward functions. 3) Task development suite: This suite streamlines the creation of driving tasks, enabling easy definition of traffic flows and vehicle routes, along with automatic collection of multi-modal observation data. A visualization server allows users to trace real-time agent driving videos and performance metrics through a browser. Furthermore, we conduct extensive experiments using built-in tasks to evaluate the performance and potential of WMs in autonomous driving. Thanks to the richness and flexibility of CarDreamer, we also systematically study the impact of observation modality, observability, and sharing of vehicle intentions on AV safety and efficiency. All code and documents are accessible on <a class="link-external link-https" href="https://github.com/ucd-dare/CarDreamer" rel="external noopener nofollow">this https URL</a>.
Robotics,Artificial Intelligence
What problem does this paper attempt to address?
The paper aims to address the problem of navigating autonomous vehicles (AVs) in complex real-world scenarios, particularly enhancing their ability to adapt to various road conditions and predict future events. Specifically, the paper introduces CarDreamer, the first open-source learning platform dedicated to developing and evaluating autonomous driving algorithms based on World Models (WM). The paper addresses the following key issues: 1. **Insufficiency of Existing Platforms**: There is currently a lack of an open-source platform to train and test complex autonomous driving algorithms, especially in complex driving environments. 2. **Application of World Models**: By integrating advanced world models such as DreamerV2, DreamerV3, and Planning2Explore, the reproduction process of Reinforcement Learning (RL) algorithms is simplified, and these models can be easily integrated through standard Gym interfaces. 3. **Built-in and Custom Tasks**: It provides a rich set of built-in tasks and a highly configurable task development suite, enabling users to easily create and test custom tasks. Built-in tasks include lane changing, overtaking, etc., and support multiple observation modalities (such as camera, LiDAR) and different levels of observability settings. 4. **Visualization and Experimental Results**: A visualization server is used to display agent driving videos and performance metrics in real-time, accelerating reward engineering and algorithm development. Additionally, the paper presents extensive experimental results, evaluating the prediction accuracy under different observation modalities and the impact of communication on traffic safety and efficiency. Through the above work, CarDreamer fills the gap in existing research and provides powerful tools and support for world model-based autonomous driving research.