Abstract:Collision avoidance is a crucial technique to achieve safe and efficient robotic vehicle navigation in unknown environments. However, moving obstacles with unpredictability in dynamic scenarios, usually increase the difficulty and complexity in collision avoidance of robotic vehicles. To enhance the stability of collision avoidance and boost its adaptability to uncertain dynamic scenes, a new attention-based value classification actor-critic (AVCAC) architecture is proposed. It is an end-to-end robot navigation model that utilizes imperfect local observation to directly plan accurate collision-free motion commands. First, we design a value-classified rollout replaybuffer to categorize the experiences into different pools. It can prevent any overfitting or bias that may result from repeatedly sampling experiences of a certain type during policy learning. Then, we improve the conventional actor-critic network with a multi-head local attention module to extract the local observations at entity-level. This way, the collision avoidance system can focus on key environmental features to operate more efficiently and respond more swiftly to dynamic changes in the environment. Moreover, a lookahead multi-step prediction (LMP) reward setting is devised in the AVCAC-based reinforcement learning (RL) framework to facilitate more informed and forward-looking decision-making. Finally, the policy entropy (PE) and policy delay (PD) are extended to AVCAC model to enhance policy exploration and make policy more robust. Extensive experimental results reveal that our method can generate time-efficient and collision-free guide paths to dodge collisions under complex dynamic environments.

Navigation Command Matching for Vision-based Autonomous Driving

Stochastic Navigation Command Matching for Imitation Learning of a Driving Policy.

Multigoal Visual Navigation With Collision Avoidance via Deep Reinforcement Learning

Visual Navigation with Multiple Goals Based on Deep Reinforcement Learning

Vision-Language Navigation Policy Learning and Adaptation

BEVNav: Robot Autonomous Navigation Via Spatial-Temporal Contrastive Learning in Bird's-Eye View

Efficient Multi-agent Cooperative Navigation in Unknown Environments with Interlaced Deep Reinforcement Learning.

Learning Reward Function with Matching Network for Mapless Navigation

Using Deep Reinforcement Learning with Automatic Curriculum Learning for Mapless Navigation in Intralogistics

Reinforcement Learning-Based Visual Navigation With Information-Theoretic Regularization

Exploring the Task Cooperation in Multi-goal Visual Navigation.

Navigation for Autonomous Vehicles Via Fast-Stable and Smooth Reinforcement Learning

NAVS: A Neural Attention-Based Visual SLAM for Autonomous Navigation in Unknown 3D Environments

Probabilistic End-to-End Vehicle Navigation in Complex Dynamic Environments with Multimodal Sensor Fusion

Autonomous navigation of mobile robots in unknown environments using off-policy reinforcement learning with curriculum learning

DeepGoal: Learning to drive with driving intention from human control demonstration

ReinforcementDriving: Exploring Trajectories and Navigation for Autonomous Vehicles

Memory-based soft actor–critic with prioritized experience replay for autonomous navigation

DMCL: Robot Autonomous Navigation Via Depth Image Masked Contrastive Learning

Attention-based Value Classification Reinforcement Learning for Collision-free Robot Navigation

Efficient Reinforcement Learning for 3D LiDAR Navigation of Mobile Robot