Abstract:Purpose Most manufacturing plants choose the easy way of completely separating human operators from robots to prevent accidents, but as a result, it dramatically affects the overall quality and speed that is expected from human–robot collaboration. It is not an easy task to ensure human safety when he/she has entered a robot’s workspace, and the unstructured nature of those working environments makes it even harder. The purpose of this paper is to propose a real-time robot collision avoidance method to alleviate this problem. Design/methodology/approach In this paper, a model is trained to learn the direct control commands from the raw depth images through self-supervised reinforcement learning algorithm. To reduce the effect of sample inefficiency and safety during initial training, a virtual reality platform is used to simulate a natural working environment and generate obstacle avoidance data for training. To ensure a smooth transfer to a real robot, the automatic domain randomization technique is used to generate randomly distributed environmental parameters through the obstacle avoidance simulation of virtual robots in the virtual environment, contributing to better performance in the natural environment. Findings The method has been tested in both simulations with a real UR3 robot for several practical applications. The results of this paper indicate that the proposed approach can effectively make the robot safety-aware and learn how to divert its trajectory to avoid accidents with humans within the workspace. Research limitations/implications The method has been tested in both simulations with a real UR3 robot in several practical applications. The results indicate that the proposed approach can effectively make the robot be aware of safety and learn how to change its trajectory to avoid accidents with persons within the workspace. Originality/value This paper provides a novel collision avoidance framework that allows robots to work alongside human operators in unstructured and complex environments. The method uses end-to-end policy training to directly extract the optimal path from the visual inputs for the scene.

Training Is Execution: A Reinforcement Learning-Based Collision Avoidance Algorithm for Volatile Scenarios

Multi-Robot Learning Dynamic Obstacle Avoidance in Formation with Information-Directed Exploration.

Enhanced method for reinforcement learning based dynamic obstacle avoidance by assessment of collision risk

Adaptive Environment Modeling Based Reinforcement Learning for Collision Avoidance in Complex Scenes

Reinforcement Learned Distributed Multi-Robot Navigation With Reciprocal Velocity Obstacle Shaped Rewards

A safe reinforcement learning approach for autonomous navigation of mobile robots in dynamic environments

Real-Time Navigation In Dynamic Human Environments Using Optimal Reciprocal Collision Avoidance

Train Trajectory Optimization with High-Risk State Space Boundaries: A Safe Reinforcement Learning Approach

Reinforcement learning-based collision avoidance: impact of reward function and knowledge transfer

Evolutionary Curriculum Training for DRL-Based Navigation Systems

Robot obstacle avoidance system using deep reinforcement learning

An Efficient Approach for Obstacle Avoidance and Navigation in Robots

Reactive Collision Avoidance for Safe Agile Navigation

A human-like collision avoidance method for USVs based on deep reinforcement learning and velocity obstacle

SAFER: Safe Collision Avoidance using Focused and Efficient Trajectory Search with Reinforcement Learning

Collision Avoidance Among Dense Heterogeneous Agents Using Deep Reinforcement Learning

Learning Configurations of Operating Environment of Autonomous Vehicles to Maximize Their Collisions

Feedback-Based Curriculum Learning for Collision Avoidance

RACE: Reinforced Cooperative Autonomous Vehicle Collision AvoidancE

Deep-Reinforcement-Learning-Based Collision Avoidance of Autonomous Driving System for Vulnerable Road User Safety