DECAF: a Discrete-Event based Collaborative Human-Robot Framework for Furniture Assembly

Giulio Giacomuzzo,Matteo Terreran,Siddarth Jain,Diego Romeres
2024-08-29
Abstract:This paper proposes a task planning framework for collaborative Human-Robot scenarios, specifically focused on assembling complex systems such as furniture. The human is characterized as an uncontrollable agent, implying for example that the agent is not bound by a pre-established sequence of actions and instead acts according to its own preferences. Meanwhile, the task planner computes reactively the optimal actions for the collaborative robot to efficiently complete the entire assembly task in the least time possible. We formalize the problem as a Discrete Event Markov Decision Problem (DE-MDP), a comprehensive framework that incorporates a variety of asynchronous behaviors, human change of mind and failure recovery as stochastic events. Although the problem could theoretically be addressed by constructing a graph of all possible actions, such an approach would be constrained by computational limitations. The proposed formulation offers an alternative solution utilizing Reinforcement Learning to derive an optimal policy for the robot. Experiments where conducted both in simulation and on a real system with human subjects assembling a chair in collaboration with a 7-DoF manipulator.
Robotics
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how to effectively plan the actions of robots to achieve efficient collaboration with human operators in human - robot collaboration scenarios, especially in complex tasks such as furniture assembly. Specifically, the paper focuses on how to calculate the optimal action plan for the robot when collaborating between an uncontrollable human operator (that is, the behavior of the human operator is not restricted by a preset sequence but acts according to his or her own preferences) and the robot, so as to complete the entire assembly task in the shortest time. The method proposed in the paper is to formalize the problem as a Discrete Event Markov Decision Problem (DE - MDP), which is a comprehensive framework that can comprehensively consider multiple asynchronous behaviors, human change of mind, and random events such as fault recovery. Since directly constructing a graph of all possible actions may be affected by computational limitations, the paper proposes an alternative solution of using Reinforcement Learning (RL) to derive the optimal strategy for the robot. Through experiments in the simulation environment and the real system, the paper verifies the effectiveness of the proposed framework. In the experiment, human participants collaborate with a 7 - degree - of - freedom robotic arm to assemble chairs. The experimental results show that, compared with humans completing the task alone, using the DECAF framework can reduce the completion time, reduce physical exertion, and improve the assembly experience of human operators.