Abstract:Purpose Most manufacturing plants choose the easy way of completely separating human operators from robots to prevent accidents, but as a result, it dramatically affects the overall quality and speed that is expected from human–robot collaboration. It is not an easy task to ensure human safety when he/she has entered a robot’s workspace, and the unstructured nature of those working environments makes it even harder. The purpose of this paper is to propose a real-time robot collision avoidance method to alleviate this problem. Design/methodology/approach In this paper, a model is trained to learn the direct control commands from the raw depth images through self-supervised reinforcement learning algorithm. To reduce the effect of sample inefficiency and safety during initial training, a virtual reality platform is used to simulate a natural working environment and generate obstacle avoidance data for training. To ensure a smooth transfer to a real robot, the automatic domain randomization technique is used to generate randomly distributed environmental parameters through the obstacle avoidance simulation of virtual robots in the virtual environment, contributing to better performance in the natural environment. Findings The method has been tested in both simulations with a real UR3 robot for several practical applications. The results of this paper indicate that the proposed approach can effectively make the robot safety-aware and learn how to divert its trajectory to avoid accidents with humans within the workspace. Research limitations/implications The method has been tested in both simulations with a real UR3 robot in several practical applications. The results indicate that the proposed approach can effectively make the robot be aware of safety and learn how to change its trajectory to avoid accidents with persons within the workspace. Originality/value This paper provides a novel collision avoidance framework that allows robots to work alongside human operators in unstructured and complex environments. The method uses end-to-end policy training to directly extract the optimal path from the visual inputs for the scene.

Autonomous Boundary of Human-Machine Collaboration System Based on Reinforcement Learning

Human-machine Shared Autonomy Approach for Non-Full-time Effective Human Decisions

Emergence of Human-comparable Balancing Behaviors by Deep Reinforcement Learning

Look Before You Leap: Safe Model-Based Reinforcement Learning with Human Intervention

A graph-based reinforcement learning-enabled approach for adaptive human-robot collaborative assembly operations

Reinforcement Learning for Human-Robot Shared Control

Shared Autonomy Based on Human-in-the-loop Reinforcement Learning with Policy Constraints

Hybrid Autonomous Controller for Bipedal Robot Balance with Deep Reinforcement Learning and Pattern Generators

A Learning Based Hierarchical Control Framework for Human-Robot Collaboration

Human-AI Collaboration in Real-World Complex Environment with Reinforcement Learning

A reinforcement learning method for human-robot collaboration in assembly tasks

Robot obstacle avoidance system using deep reinforcement learning

D-HAL: Distributed Hierarchical Adversarial Learning for Multi-Agent Interaction in Autonomous Intersection Management

An Efficient and Responsive Robot Motion Controller for Safe Human-Robot Collaboration

Human-in-the-Loop Deep Reinforcement Learning with Application to Autonomous Driving

Traded Control of Human–Machine Systems for Sequential Decision-Making Based on Reinforcement Learning

Application of Hybrid Deep Reinforcement Learning for Managing Connected Cars at Pedestrian Crossings: Challenges and Research Directions

Enhancing Socially-Aware Robot Navigation through Bidirectional Natural Language Conversation

Unified Human-Robot-Environment Interaction Control in Contact-Rich Collaborative Manipulation Tasks Via Model-Based Reinforcement Learning

Model-Based Reinforcement Learning Variable Impedance Control for Human-Robot Collaboration

A Learning-Based Framework for Safe Human-Robot Collaboration with Multiple Backup Control Barrier Functions