Abstract:Aiming at the problem of vehicle model tracking error and overdependence in traditional path planning of intelligent driving vehicles, a path planning method of intelligent driving vehicles based on deep reinforcement learning is proposed. Firstly, the abstract model of real environment is extracted. The model uses deep reinforcement learning end-to-end strategy (DRL-ETE) and vehicle dynamics model to train the reinforcement learning model which approaches the optimal intelligent driving. Secondly, the real scene problem is transferred to the virtual abstract model through the model transfer strategy, and the control and trajectory sequences are calculated according to the trained deep reinforcement learning model in the environment. Finally, the optimal trajectory sequence is selected according to the evaluation function in real environment. Because the storage mode of experience playback mechanism of Deep Q-Network algorithm is FIFO, and the sampling mode of later playback training is average sampling, the efficiency of experience playback is low. These two problems lead to the slow process of intelligent driving vehicle to target and route finding. And because of greedy strategy, the information of exploration map is incomplete, and IDQNPER algorithm model is proposed. When storing samples, the samples are given weight, and sent to the network in priority order for sample training. Meanwhile, the importance data sequence is retained in the experience playback cache area, and the sequence with high similarity is removed. The total reward value is about 10% higher than the reward value of original Deep Q-network, which proves that the accuracy of intelligent driving vehicles tends to target points is higher. In order to further realize the autonomous decision-making of intelligent driving vehicles and solve the problem of relying too much on map information in the traditional human planning framework, an end-to-end path planning method is proposed based on the depth reinforcement learning theory, which maps the action instructions directly from the sensor information and then issues them to the intelligent driving vehicles. Firstly, CNN and LSTM are used to process radar and camera information. By comparing the advantages of DQ, Double DQN, Dueling DQN and PER algorithm, IDQNPER algorithm is used to train the automatic path planning of intelligent driving vehicles. Finally, the simulation and verification experiments are carried out in the static obstacle environment. The test results show that IDQNPER algorithm is adaptable to intelligent vehicles in different environments. The method can deal with the continuous input state and generate the continuous control sequence of the corner control, which can reduce the lateral tracking error. At the same time, the generalization performance of the model can be improved and the overdependence problem can be reduced by experience playback.

ETQ-learning: an improved Q-learning algorithm for path planning

An optimized Q-Learning algorithm for mobile robot local path planning

Reinforcement learning path planning algorithm based on obstacle area expansion strategy

A path planning approach for mobile robots using short and safe Q-learning.

Improved reinforcement learning path planning algorithm integrating prior knowledge

Improved reinforcement learning algorithm for mobile robot path planning

An improved DQN path planning algorithm

Optimal path planning approach based on Q-learning algorithm for mobile robots

A Path-Planning Approach Based on Potential and Dynamic Q-Learning for Mobile Robots in Unknown Environment

A novel Q-learning algorithm based on improved whale optimization algorithm for path planning

Research on path planning algorithm of mobile robot based on reinforcement learning

Improved Robot Path Planning Method Based on Deep Reinforcement Learning

Biologically Inspired Complete Coverage Path Planning Algorithm Based on Q-Learning

A Distributed Path Planning Algorithm via Reinforcement Learning

Improved Dyna-Q: A Reinforcement Learning Method Focused via Heuristic Graph for AGV Path Planning in Dynamic Environments

Path Planning for Autonomous Vehicles in Unknown Dynamic Environment Based on Deep Reinforcement Learning

An Improved Dyna-Q Algorithm Inspired by the Forward Prediction Mechanism in the Rat Brain for Mobile Robot Path Planning

Reinforcement Learning Path Planning Method with Error Estimation

CLSQL: Improved Q-Learning Algorithm Based on Continuous Local Search Policy for Mobile Robot Path Planning

Improved Q -Learning Method for Multirobot Formation and Path Planning with Concave Obstacles

An Improved Algorithm of Robot Path Planning in Complex Environment Based on Double DQN