Abstract:Path planning algorithm has always been the core of intelligent robot research; a good path planning algorithm can significantly enhance the efficiency of robots in executing tasks. As the application scenarios for intelligent robots continue to diversify, their adaptability to the environment has become a key focus in current path planning algorithm research. As one of the classic reinforcement learning algorithms, Q-learning (QL) algorithm has its inherent advantages in adapting to the environment, but it also faces various challenges and shortcomings. These issues are primarily centered around suboptimal path planning, slow convergence speed, weak generalization capability and poor obstacle avoidance performance. In order to solve these issues in the QL algorithm, we have carried out the following work. (1) We redesign the reward mechanism of QL algorithm. The traditional Q-learning algorithm's reward mechanism is simple to implement but lacks directionality. We propose a combined reward mechanism of "static assignment + dynamic adjustment." This mechanism can address the issue of random path selection and ultimately lead to optimal path planning. (2) We redesign the greedy strategy of QL algorithm. In the traditional Q-learning algorithm, the greedy factor in the strategy is either randomly generated or set manually, which limits its applicability to some extent. It is difficult to effectively applied to different physical environments and scenarios, which is the fundamental reason for the poor generalization capability of the algorithm. We propose a dynamic adjustment of the greedy factor, known as the greedy strategy, which significantly improves the efficiency of Q-learning algorithm and enhances its generalization capability so that the algorithm has a wider range of application scenarios. (3) We introduce a concept to enhance the algorithm's obstacle avoidance performance. We design the expansion distance, which pre-sets a "collision buffer" between the obstacle and agent to enhance the algorithm's obstacle avoidance performance.

Agent Maze Path Planning Based on Simulated Annealing Q-Learning Algorithm

Reinforcement learning path planning algorithm based on obstacle area expansion strategy

A Distributed Path Planning Algorithm via Reinforcement Learning

ETQ-learning: an improved Q-learning algorithm for path planning

An optimized Q-Learning algorithm for mobile robot local path planning

Indoor Emergency Path Planning Based on the Q-Learning Optimization Algorithm

Biologically Inspired Complete Coverage Path Planning Algorithm Based on Q-Learning

Path Planning of Autonomous Mobile Robot in Comprehensive Unknown Environment Using Deep Reinforcement Learning

Path Planning Method With Improved Artificial Potential Field—A Reinforcement Learning Perspective

Optimal Exploration Algorithm of Multi-Agent Reinforcement Learning Methods (Student Abstract)

An Improved Dyna-Q Algorithm Inspired by the Forward Prediction Mechanism in the Rat Brain for Mobile Robot Path Planning

A Path-Planning Method Based on Improved Soft Actor-Critic Algorithm for Mobile Robots

A Path-Planning Approach Based on Potential and Dynamic Q-Learning for Mobile Robots in Unknown Environment

Multi-robot dynamic path planning with priority based on simulated annealing

A path planning approach for mobile robots using short and safe Q-learning.

Dynamic path planning of mobile robot based on improved simulated annealing algorithm

Robot path planner based on deep reinforcement learning and the seeker optimization algorithm

Learning Efficient Multi-Agent Cooperative Visual Exploration

Robot Path Planning Based on Artificial Potential Field Approach with Simulated Annealing

Reinforcement Learning-Based Path Planning Algorithm for Mobile Robots

Asynchronous reinforcement learning algorithms for solving discrete space path planning problems