Abstract:Automatic guided vehicles have become an important part of transporting goods in dynamic environments, and how to design an efficient path planning method for multiple AGVs is a current research hotspot. Due to the complex road conditions in dynamic environments, there may be dynamic obstacles and situations in which only the target point is known but a complete map is lacking, which leads to poor path planning and long planning time for multiple automatic guided vehicles (AGVs). In this paper, a two-level path planning method (referred to as GA-KL, genetic KL method) for multi-AGVs is proposed by integrating the scheduling policy into global path planning and combining the global path planning algorithm and local path planning algorithm. First, for local path planning, we propose an improved Q-learning path optimization algorithm (K-L, Kohonen Q-learning algorithm) based on a Kohonen network, which can avoid dynamic obstacles and complete autonomous path finding using the autonomous learning function of the Q-learning algorithm. Then, we adopt the idea of combining global and local planning by combining the K-L algorithm with the improved genetic algorithm; in addition, we integrate the scheduling policy into global path planning, which can continuously adjust the scheduling policy of multi-AGVs according to changes in the dynamic environment. Finally, through simulation and field experiments, we verified that the K-L algorithm can accomplish autonomous path finding; compared with the traditional path planning algorithm, the algorithm achieved improves results in path length and convergence time with various maps; the convergence time of the algorithm was reduced by about 6.3%, on average, and the path length was reduced by about 4.6%, on average. The experiments also show that the GA-KL method has satisfactory global search capability and can effectively avoid dynamic obstacles. The final experiments also demonstrated that the GA-KL method reduced the total path completion time by an average of 12.6% and the total path length by an average of 8.4% in narrow working environments or highly congested situations, which considerably improved the efficiency of the multi-AGVs.

Cooperative Control of Multiple AGVs Based on Multi-Agent Reinforcement Learning

Learning to Cooperate: Application of Deep Reinforcement Learning for Online AGV Path Finding.

Research on Cooperative Scheduling of AGV Transportation and Charging in Intelligent Warehouse System Based on Dynamic Task Chain

Mapless Collaborative Navigation for a Multi-Robot System Based on the Deep Reinforcement Learning

Decentralized Multi-AGV Task Allocation based on Multi-Agent Reinforcement Learning with Information Potential Field Rewards

Toward Energy-Efficient Routing of Multiple AGVs with Multi-Agent Reinforcement Learning

A Collaborative Control Method of Dual-Arm Robots Based on Deep Reinforcement Learning

Research on Multi-AGVs dynamic scheduling based on deep reinforcement learning

The Design and Realization of Multi-agent Obstacle Avoidance based on Reinforcement Learning

A Self-Attention-Based Deep Reinforcement Learning Approach for AGV Dispatching Systems

Scheduling of AGVs in Automated Container Terminal Based on the Deep Deterministic Policy Gradient (DDPG) Using the Convolutional Neural Network (CNN)

A Multiobjective Reinforcement Learning Approach for AGV Task Clustering

Research on Dynamic Path Planning of Multi-AGVs Based on Reinforcement Learning

Multiple Ships Cooperative Navigation and Collision Avoidance using Multi-agent Reinforcement Learning with Communication

A decentralized path planning model based on deep reinforcement learning

Autonomous and cooperative control of UAV cluster with multi-agent reinforcement learning

Multi-objective Dynamic AGV Scheduling Method Based on Deep Reinforcement Learning

Distributed Drive Autonomous Vehicle Trajectory Tracking Control Based on Multi-Agent Deep Reinforcement Learning

Multi-Objective Optimization of AGV Real-Time Scheduling Based on Deep Reinforcement Learning

Anti-conflict AGV path planning in automated container terminals based on multi-agent reinforcement learning