Abstract:Aiming at the problem of vehicle model tracking error and overdependence in traditional path planning of intelligent driving vehicles, a path planning method of intelligent driving vehicles based on deep reinforcement learning is proposed. Firstly, the abstract model of real environment is extracted. The model uses deep reinforcement learning end-to-end strategy (DRL-ETE) and vehicle dynamics model to train the reinforcement learning model which approaches the optimal intelligent driving. Secondly, the real scene problem is transferred to the virtual abstract model through the model transfer strategy, and the control and trajectory sequences are calculated according to the trained deep reinforcement learning model in the environment. Finally, the optimal trajectory sequence is selected according to the evaluation function in real environment. Because the storage mode of experience playback mechanism of Deep Q-Network algorithm is FIFO, and the sampling mode of later playback training is average sampling, the efficiency of experience playback is low. These two problems lead to the slow process of intelligent driving vehicle to target and route finding. And because of greedy strategy, the information of exploration map is incomplete, and IDQNPER algorithm model is proposed. When storing samples, the samples are given weight, and sent to the network in priority order for sample training. Meanwhile, the importance data sequence is retained in the experience playback cache area, and the sequence with high similarity is removed. The total reward value is about 10% higher than the reward value of original Deep Q-network, which proves that the accuracy of intelligent driving vehicles tends to target points is higher. In order to further realize the autonomous decision-making of intelligent driving vehicles and solve the problem of relying too much on map information in the traditional human planning framework, an end-to-end path planning method is proposed based on the depth reinforcement learning theory, which maps the action instructions directly from the sensor information and then issues them to the intelligent driving vehicles. Firstly, CNN and LSTM are used to process radar and camera information. By comparing the advantages of DQ, Double DQN, Dueling DQN and PER algorithm, IDQNPER algorithm is used to train the automatic path planning of intelligent driving vehicles. Finally, the simulation and verification experiments are carried out in the static obstacle environment. The test results show that IDQNPER algorithm is adaptable to intelligent vehicles in different environments. The method can deal with the continuous input state and generate the continuous control sequence of the corner control, which can reduce the lateral tracking error. At the same time, the generalization performance of the model can be improved and the overdependence problem can be reduced by experience playback.

Improved reinforcement learning path planning algorithm integrating prior knowledge

Improved reinforcement learning algorithm for mobile robot path planning

An optimized Q-Learning algorithm for mobile robot local path planning

Research on path planning algorithm of mobile robot based on reinforcement learning

ETQ-learning: an improved Q-learning algorithm for path planning

Path Planning Method of Mobile Robot Using Improved Deep Reinforcement Learning

Reinforcement Learning Path Planning Method with Error Estimation

Solving the optimal path planning of a mobile robot using improved Q-learning

Improved Robot Path Planning Method Based on Deep Reinforcement Learning

Improved Path Planning for Indoor Patrol Robot Based on Deep Reinforcement Learning

Mobile robot navigation method based on improved Q-learning algorithm

Path Planning of Autonomous Mobile Robot in Comprehensive Unknown Environment Using Deep Reinforcement Learning

A Path-Planning Approach Based on Potential and Dynamic Q-Learning for Mobile Robots in Unknown Environment

A path planning approach for mobile robots using short and safe Q-learning.

Mobile Robot Path Planning Using a QAPF Learning Algorithm for Known and Unknown Environments

A Path-Planning Method Based on Improved Soft Actor-Critic Algorithm for Mobile Robots

Reinforcement Learning-Based Path Planning Algorithm for Mobile Robots

Reinforcement learning path planning algorithm based on obstacle area expansion strategy

An improved DQN path planning algorithm

An Improved Dyna-Q Algorithm Inspired by the Forward Prediction Mechanism in the Rat Brain for Mobile Robot Path Planning

Path Planning for Autonomous Vehicles in Unknown Dynamic Environment Based on Deep Reinforcement Learning