Abstract:The development and deployment of robots on construction sites are integral to the industrialization of construction, known as Construction 4.0. Tele-operated and pre-programmed robots have enhanced construction efficiency and safety. However, their utilization on-site remains limited due to the need for expert remote control and the lack of adaptability in dynamic environments. Reinforcement learning (RL) has emerged as a promising solution, as RL-controlled robots possess inherent self-learning abilities to adapt to diverse situations. Nevertheless, manual design of RL reward functions for complex tasks poses challenges. To address this issue, inverse reinforcement learning (IRL) methods, such as Generative Adversarial Imitation Learning (GAIL), have been proposed to learn optimal actions through expert demonstration and self-exploration, without explicitly defined reward functions. In this study, we propose an innovative approach integrating GAIL and virtual reality (VR) integrated robot control approach to control robots for long-horizon collaborative construction tasks involving multiple sub-tasks. We employ VR expert demonstrations as input for GAIL training, enabling a team of robots, including an Unmanned Ground Vehicle (UGV) and two robot arms, to interact with the designed RL environment and perform tasks such as transporting, picking, and installing window panels. Handle long-horizon collaborative construction tasks (i.e., a long sequence of several sub-tasks performed by multiple robots). For evaluation, we compare the performance of our VR-GAIL model with a prevalent and robust RL baseline model, Proximal Policy Optimization (PPO). The results demonstrate that our reward-free VR-GAIL model achieves, on average, a 4.5% higher success rate than the PPO counterpart equipped with carefully designed reward functions across all three sub-tasks and their randomized variations. Furthermore, the performance gap between GAIL and PPO widens as the task difficulty increases. These findings indicate that our approach effectively enhances RL agent performance in tackling complex construction tasks while expediting development by eliminating reward function design requirements.

RIRL: A Recurrent Imitation and Reinforcement Learning Method for Long-Horizon Robotic Tasks

Multi-State-Space Reasoning Reinforcement Learning for Long-Horizon RFID-Based Robotic Searching and Planning Tasks

Robot Simulation and Reinforcement Learning Training Platform Based on Distributed Architecture.

CLFR-M: Continual Learning Framework for Robots Via Human Feedback and Dynamic Memory

LIRL: Latent Imagination-Based Reinforcement Learning for Efficient Coverage Path Planning

RLIF: Interactive Imitation Learning as Reinforcement Learning

Learning with Training Wheels: Speeding up Training with a Simple Controller for Deep Reinforcement Learning

FLTRNN: Faithful Long-Horizon Task Planning for Robotics with Large Language Models

Deep Reinforcement Learning Enables Joint Trajectory and Communication in Internet of Robotic Things

Example-Driven Model-Based Reinforcement Learning for Solving Long-Horizon Visuomotor Tasks

Relay Hindsight Experience Replay: Self-guided continual reinforcement learning for sequential object manipulation tasks with sparse rewards

RLingua: Improving Reinforcement Learning Sample Efficiency in Robotic Manipulations With Large Language Models

Informative Path Planning for Mobile Sensing with Reinforcement Learning

A data-efficient goal-directed deep reinforcement learning method for robot visuomotor skill

ReIL: A Framework for Reinforced Intervention-based Imitation Learning

PLANRL: A Motion Planning and Imitation Learning Framework to Bootstrap Reinforcement Learning

Enhancing construction robot learning for collaborative and long-horizon tasks using generative adversarial imitation learning

Imitation Bootstrapped Reinforcement Learning

Interactive Imitation Learning in Robotics based on Simulations

Extended residual learning with one-shot imitation learning for robotic assembly in semi-structured environment

Learning and Retrieval from Prior Data for Skill-based Imitation Learning