GAD-Generative Learning for HD Map-Free Autonomous Driving

Weijian Sun,Yanbo Jia,Qi Zeng,Zihao Liu,Jiang Liao,Yue Li,Xianfeng Li
2024-06-01
Abstract:Deep-learning-based techniques have been widely adopted for autonomous driving software stacks for mass production in recent years, focusing primarily on perception modules, with some work extending this method to prediction modules. However, the downstream planning and control modules are still designed with hefty handcrafted rules, dominated by optimization-based methods such as quadratic programming or model predictive control. This results in a performance bottleneck for autonomous driving systems in that corner cases simply cannot be solved by enumerating hand-crafted rules. We present a deep-learning-based approach that brings prediction, decision, and planning modules together with the attempt to overcome the rule-based methods' deficiency in real-world applications of autonomous driving, especially for urban scenes. The DNN model we proposed is solely trained with 10 hours of human driver data, and it supports all mass-production ADAS features available on the market to date. This method is deployed onto a Jiyue test car with no modification to its factory-ready sensor set and compute platform. the feasibility, usability, and commercial potential are demonstrated in this article.
Robotics,Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
This paper mainly discusses how to use generative learning methods to integrate the prediction, decision-making, and planning modules of autonomous driving to overcome the limitations of rule-based methods, especially in urban driving scenarios, when high-precision maps (HD maps) are unavailable. Current autonomous driving systems typically employ an engineering approach by dividing the tasks into sub-tasks such as perception, prediction, planning, and control. However, the downstream planning and control modules still rely on manually designed rules, which becomes a performance bottleneck when dealing with extreme situations. The paper proposes a deep learning (DNN) model that only uses 10 hours of human driver data for training and can support all existing advanced driver assistance system (ADAS) functions on the market. The model is deployed on Jiye test vehicles with unmodified factory sensor sets and computing platforms, demonstrating its practicality and commercial potential. To overcome the limitations of rule-based planning modules, the paper proposes an interpretable and adjustable framework that includes a trajectory generator and an evaluator, which comprehensively handles prediction and planning tasks from a data-driven perspective. The generator uses vectorized embeddings to generate trajectories, while the evaluator uses gridded non-parametric information to validate and score candidate trajectories. This approach is validated in actual closed-loop tests and performs comparably to optimization-based off-the-shelf planners. In addition, the paper also describes how to reconstruct local topology using landmarks and occupied grid cells to support motion planning in urban environments. Although the initial focus is on daily commuting, the method is also applicable to various route formats. Through real-world closed-loop tests in complex urban environments, the paper emphasizes the importance of closed-loop evaluation as it can more accurately reflect the system's performance, not just offline imitation metrics. In summary, the main contributions of this paper are: 1. Proposing a data-driven prediction and planning framework that is scalable, adjustable, and does not require high-definition maps. 2. Evaluating this system for the first time on complex, real-world urban roads using factory-ready sensor setups and computing platforms. 3. Expanding data-driven methods by proposing a generative learning approach that combines maximum border planning and multimodal imitation to improve motion planning performance.