Abstract:Deep-learning-based techniques have been widely adopted for autonomous driving software stacks for mass production in recent years, focusing primarily on perception modules, with some work extending this method to prediction modules. However, the downstream planning and control modules are still designed with hefty handcrafted rules, dominated by optimization-based methods such as quadratic programming or model predictive control. This results in a performance bottleneck for autonomous driving systems in that corner cases simply cannot be solved by enumerating hand-crafted rules. We present a deep-learning-based approach that brings prediction, decision, and planning modules together with the attempt to overcome the rule-based methods' deficiency in real-world applications of autonomous driving, especially for urban scenes. The DNN model we proposed is solely trained with 10 hours of human driver data, and it supports all mass-production ADAS features available on the market to date. This method is deployed onto a Jiyue test car with no modification to its factory-ready sensor set and compute platform. the feasibility, usability, and commercial potential are demonstrated in this article.

What problem does this paper attempt to address?

This paper mainly discusses how to use generative learning methods to integrate the prediction, decision-making, and planning modules of autonomous driving to overcome the limitations of rule-based methods, especially in urban driving scenarios, when high-precision maps (HD maps) are unavailable. Current autonomous driving systems typically employ an engineering approach by dividing the tasks into sub-tasks such as perception, prediction, planning, and control. However, the downstream planning and control modules still rely on manually designed rules, which becomes a performance bottleneck when dealing with extreme situations. The paper proposes a deep learning (DNN) model that only uses 10 hours of human driver data for training and can support all existing advanced driver assistance system (ADAS) functions on the market. The model is deployed on Jiye test vehicles with unmodified factory sensor sets and computing platforms, demonstrating its practicality and commercial potential. To overcome the limitations of rule-based planning modules, the paper proposes an interpretable and adjustable framework that includes a trajectory generator and an evaluator, which comprehensively handles prediction and planning tasks from a data-driven perspective. The generator uses vectorized embeddings to generate trajectories, while the evaluator uses gridded non-parametric information to validate and score candidate trajectories. This approach is validated in actual closed-loop tests and performs comparably to optimization-based off-the-shelf planners. In addition, the paper also describes how to reconstruct local topology using landmarks and occupied grid cells to support motion planning in urban environments. Although the initial focus is on daily commuting, the method is also applicable to various route formats. Through real-world closed-loop tests in complex urban environments, the paper emphasizes the importance of closed-loop evaluation as it can more accurately reflect the system's performance, not just offline imitation metrics. In summary, the main contributions of this paper are: 1. Proposing a data-driven prediction and planning framework that is scalable, adjustable, and does not require high-definition maps. 2. Evaluating this system for the first time on complex, real-world urban roads using factory-ready sensor setups and computing platforms. 3. Expanding data-driven methods by proposing a generative learning approach that combines maximum border planning and multimodal imitation to improve motion planning performance.

GAD-Generative Learning for HD Map-Free Autonomous Driving

Driving Behaviour Style Study with a Hybrid Deep Learning Framework Based on GPS Data

GenAD: Generalized Predictive Model for Autonomous Driving

GenAD: Generative End-to-End Autonomous Driving

Planning-oriented Autonomous Driving

DeepGoal: Learning to drive with driving intention from human control demonstration

DeepDriving: Learning Affordance for Direct Perception in Autonomous Driving

Adaptive Visual Interaction Based Multi-Target Future State Prediction For Autonomous Driving Vehicles

Humanlike Driving: Empirical Decision-Making System for Autonomous Vehicles

DQ-GAT: Towards Safe and Efficient Autonomous Driving With Deep Q-Learning and Graph Attention Networks

Efficient and Generalized End-to-end Autonomous Driving System with Latent Deep Reinforcement Learning and Demonstrations

ADriver-I: A General World Model for Autonomous Driving

HE-Drive: Human-Like End-to-End Driving with Vision Language Models

Trajectory-guided Control Prediction for End-to-end Autonomous Driving: A Simple yet Strong Baseline

Neural HD Map Generation from Multiple Vectorized Tiles Locally Produced by Autonomous Vehicles

A Cognitive-Based Trajectory Prediction Approach for Autonomous Driving

Deep learning-based vehicle trajectory prediction based on generative adversarial network for autonomous driving applications

Automatic Building and Labeling of HD Maps with Deep Learning