Abstract:The autonomous driving technology based on deep reinforcement learning (DRL) has been confirmed as one of the most cutting-edge research fields worldwide. The agent is enabled to achieve the goal of making independent decisions by interacting with the environment and learning driving strategies based on the feedback from the environment. This technology has been widely used in end-to-end driving tasks. However, this field faces several challenges. First, developing real vehicles is expensive, time-consuming, and risky. To further expedite the testing, verification, and iteration of end-to-end deep reinforcement learning algorithms, a joint simulation development and validation platform was designed and implemented in this study based on VTD–CarSim and the Tensorflow deep learning framework, and research work was conducted based on this platform. Second, sparse reward signals can cause problems (e.g., a low-sample learning rate). It is imperative for the agent to be capable of navigating in an unfamiliar environment and driving safely under a wide variety of weather or lighting conditions. To address the problem of poor generalization ability of the agent to unknown scenarios, a deep deterministic policy gradient (DDPG) decision-making and planning method was proposed in this study in accordance with a multi-task fusion strategy. The main task based on DRL decision-making planning and the auxiliary task based on image semantic segmentation were cross-fused, and part of the network was shared with the main task to reduce the possibility of model overfitting and improve the generalization ability. As indicated by the experimental results, first, the joint simulation development and validation platform built in this study exhibited prominent versatility. Users were enabled to easily substitute any default module with customized algorithms and verify the effectiveness of new functions in enhancing overall performance using other default modules of the platform. Second, the deep reinforcement learning strategy based on multi-task fusion proposed in this study was competitive. Its performance was better than other DRL algorithms in certain tasks, which improved the generalization ability of the vehicle decision-making planning algorithm.

Unsupervised Reinforcement Learning for Multi-Task Autonomous Driving: Expanding Skills and Cultivating Curiosity

Unsupervised Discovery of Transitional Skills for Deep Reinforcement Learning

Learning an Efficient and Safe Policy for Highway Driving Using Supervised Learning and Reinforcement Learning.

Multi-Input Autonomous Driving Based on Deep Reinforcement Learning with Double Bias Experience Replay

Task-Driven Autonomous Driving: Balanced Strategies Integrating Curriculum Reinforcement Learning and Residual Policy

CIRL: Controllable Imitative Reinforcement Learning for Vision-Based Self-driving

A Multi-Task Fusion Strategy-Based Decision-Making and Planning Method for Autonomous Driving Vehicles

Accelerating Reinforcement Learning for Autonomous Driving using Task-Agnostic and Ego-Centric Motion Skills

Efficient Reinforcement Learning for Autonomous Driving with Parameterized Skills and Priors

Improved Deep Reinforcement Learning with Expert Demonstrations for Urban Autonomous Driving

Multi-Task Long-Range Urban Driving Based on Hierarchical Planning and Reinforcement Learning

A Multi-sensing Input and Multi-constraint Reward Mechanism Based Deep Reinforcement Learning Method for Self-driving Policy Learning

Understanding the Complexity Gains of Contextual Multi-task RL with Curricula

Towards Autonomous Driving Decision by Combining Self-attention and Deep Reinforcement Learning.

Continuous Reinforcement Learning From Human Demonstrations With Integrated Experience Replay For Autonomous Driving

Deep Reinforcement Learning on Autonomous Driving Policy With Auxiliary Critic Network

Multi-objective Optimization Based Deep Reinforcement Learning for Autonomous Driving Policy

Efficient Deep Reinforcement Learning with Imitative Expert Priors for Autonomous Driving

Multi-Agent Reinforcement Learning for Autonomous Driving: A Survey

Urban Driving with Multi-Objective Deep Reinforcement Learning

MRIC: Model-Based Reinforcement-Imitation Learning with Mixture-of-Codebooks for Autonomous Driving Simulation