Abstract:Aiming at the problem of vehicle model tracking error and overdependence in traditional path planning of intelligent driving vehicles, a path planning method of intelligent driving vehicles based on deep reinforcement learning is proposed. Firstly, the abstract model of real environment is extracted. The model uses deep reinforcement learning end-to-end strategy (DRL-ETE) and vehicle dynamics model to train the reinforcement learning model which approaches the optimal intelligent driving. Secondly, the real scene problem is transferred to the virtual abstract model through the model transfer strategy, and the control and trajectory sequences are calculated according to the trained deep reinforcement learning model in the environment. Finally, the optimal trajectory sequence is selected according to the evaluation function in real environment. Because the storage mode of experience playback mechanism of Deep Q-Network algorithm is FIFO, and the sampling mode of later playback training is average sampling, the efficiency of experience playback is low. These two problems lead to the slow process of intelligent driving vehicle to target and route finding. And because of greedy strategy, the information of exploration map is incomplete, and IDQNPER algorithm model is proposed. When storing samples, the samples are given weight, and sent to the network in priority order for sample training. Meanwhile, the importance data sequence is retained in the experience playback cache area, and the sequence with high similarity is removed. The total reward value is about 10% higher than the reward value of original Deep Q-network, which proves that the accuracy of intelligent driving vehicles tends to target points is higher. In order to further realize the autonomous decision-making of intelligent driving vehicles and solve the problem of relying too much on map information in the traditional human planning framework, an end-to-end path planning method is proposed based on the depth reinforcement learning theory, which maps the action instructions directly from the sensor information and then issues them to the intelligent driving vehicles. Firstly, CNN and LSTM are used to process radar and camera information. By comparing the advantages of DQ, Double DQN, Dueling DQN and PER algorithm, IDQNPER algorithm is used to train the automatic path planning of intelligent driving vehicles. Finally, the simulation and verification experiments are carried out in the static obstacle environment. The test results show that IDQNPER algorithm is adaptable to intelligent vehicles in different environments. The method can deal with the continuous input state and generate the continuous control sequence of the corner control, which can reduce the lateral tracking error. At the same time, the generalization performance of the model can be improved and the overdependence problem can be reduced by experience playback.

Improved M-DQN with $\Epsilon$-Ucb Action Selection Policy and Multi-Goal Fusion Reward Function for Mobile Robot Path Planning

DM-DQN: Dueling Munchausen deep Q network for robot path planning

An Improved Algorithm of Robot Path Planning in Complex Environment Based on Double DQN

Improved Robot Path Planning Method Based on Deep Reinforcement Learning

Enhancing Stability and Performance in Mobile Robot Path Planning with PMR-Dueling DQN Algorithm

Path Planning Method of Mobile Robot Using Improved Deep Reinforcement Learning

Multi‐robot path planning based on a deep reinforcement learning DQN algorithm

Path Planning of Autonomous Mobile Robot in Comprehensive Unknown Environment Using Deep Reinforcement Learning

Path Planning for Mobile Robot Based on Deep Reinforcement Learning and Fuzzy Control

A Mapless Local Path Planning Approach Using Deep Reinforcement Learning Framework

Path planning for outdoor mobile robots based on IDDQN (October 2023)

Reinforcement based mobile robot path planning with improved dynamic window approach in unknown environment

Efficient Path Planning for Mobile Robot Based on Deep Deterministic Policy Gradient

A path planning approach for mobile robots using short and safe Q-learning.

Mapless Path Planning for Mobile Robot Based on Improved Deep Deterministic Policy Gradient Algorithm

An improved DQN path planning algorithm

A new SFN scheme for DTMB system based on UTC reference time

Design and Experimental Validation of Deep Reinforcement Learning-Based Fast Trajectory Planning and Control for Mobile Robot in Unknown Environment

Multi-objective Path Planning Based on Deep Reinforcement Learning

An Improved Dyna-Q Algorithm Inspired by the Forward Prediction Mechanism in the Rat Brain for Mobile Robot Path Planning

A path planning algorithm fusion of obstacle avoidance and memory functions