Abstract:The autonomous driving technology based on deep reinforcement learning (DRL) has been confirmed as one of the most cutting-edge research fields worldwide. The agent is enabled to achieve the goal of making independent decisions by interacting with the environment and learning driving strategies based on the feedback from the environment. This technology has been widely used in end-to-end driving tasks. However, this field faces several challenges. First, developing real vehicles is expensive, time-consuming, and risky. To further expedite the testing, verification, and iteration of end-to-end deep reinforcement learning algorithms, a joint simulation development and validation platform was designed and implemented in this study based on VTD–CarSim and the Tensorflow deep learning framework, and research work was conducted based on this platform. Second, sparse reward signals can cause problems (e.g., a low-sample learning rate). It is imperative for the agent to be capable of navigating in an unfamiliar environment and driving safely under a wide variety of weather or lighting conditions. To address the problem of poor generalization ability of the agent to unknown scenarios, a deep deterministic policy gradient (DDPG) decision-making and planning method was proposed in this study in accordance with a multi-task fusion strategy. The main task based on DRL decision-making planning and the auxiliary task based on image semantic segmentation were cross-fused, and part of the network was shared with the main task to reduce the possibility of model overfitting and improve the generalization ability. As indicated by the experimental results, first, the joint simulation development and validation platform built in this study exhibited prominent versatility. Users were enabled to easily substitute any default module with customized algorithms and verify the effectiveness of new functions in enhancing overall performance using other default modules of the platform. Second, the deep reinforcement learning strategy based on multi-task fusion proposed in this study was competitive. Its performance was better than other DRL algorithms in certain tasks, which improved the generalization ability of the vehicle decision-making planning algorithm.

Research on Deep Reinforcement Learning Algorithm Based on Dynamic Fusion Target

Deep Reinforcement Learning with Sarsa and Q-Learning: A Hybrid Approach

A Dynamic Adjusting Reward Function Method for Deep Reinforcement Learning with Adjustable Parameters

Deep Reinforcement Learning of the Model Fusion with Double Q-learning

Cooperative multi-agent target searching: a deep reinforcement learning approach based on parallel hindsight experience replay

Research on Autonomous Obstacle Avoidance and Target Tracking of UAV Based on Improved Dueling DQN Algorithm

Dynamic Weapon Target Assignment Based on Deep Q Network

Dynamic fusion for ensemble of deep Q-network

A Two-Stage Target Search and Tracking Method for UAV Based on Deep Reinforcement Learning

Based on Doubly Decoupled Reinforced Network

Path Planning in an Unknown Environment Based on Deep Reinforcement Learning with Prior Knowledge

Deep Reinforcement Learning for the Improvement of Robot Manipulation Skills under Sparse Reward

DHQN - a Stable Approach to Remove Target Network from Deep Q-learning Network.

Study on the Path Planning Algorithm Based on Dueling Deep Q Network

A TD3-based multi-agent deep reinforcement learning method in mixed cooperation-competition environment

Improved SARSA and DQN algorithms for reinforcement learning

Path Planning for Mobile Robot Based on Deep Reinforcement Learning and Fuzzy Control

Research on Path Planning of Mobile Robot Based on Improved Deep Q Network

UAV Autonomous Trajectory Planning in Target Tracking Tasks Via a DQN Approach.

A Multi-Task Fusion Strategy-Based Decision-Making and Planning Method for Autonomous Driving Vehicles

UAV Air Game Maneuver Decision-Making Using Dueling Double Deep Q Network with Expert Experience Storage Mechanism