Abstract:Leveraging reinforcement learning on high-precision decision-making in Robot Arm assembly scenes is a desired goal in the industrial community. However, tasks like Flexible Flat Cable (FFC) assembly, which require highly trained workers, pose significant challenges due to sparse rewards and limited learning conditions. In this work, we propose a goal-conditioned self-imitation reinforcement learning method for FFC assembly without relying on a specific end-effector, where both perception and behavior plannings are learned through reinforcement learning. We analyze the challenges faced by Robot Arm in high-precision assembly scenarios and balance the breadth and depth of exploration during training. Our end-to-end model consists of hindsight and self-imitation modules, allowing the Robot Arm to leverage futile exploration and optimize successful trajectories. Our method does not require rule-based or manual rewards, and it enables the Robot Arm to quickly find feasible solutions through experience relabeling, while unnecessary explorations are avoided. We train the FFC assembly policy in a simulation environment and transfer it to the real scenario by using domain adaptation. We explore various combinations of hindsight and self-imitation learning, and discuss the results comprehensively. Experimental findings demonstrate that our model achieves fast and advanced flexible flat cable assembly, surpassing other reinforcement learning-based methods.Note to Practitioners-The motivation of this article stems from the need to develop an efficient and accurate FFC assembly policy for 3C (Computer, Communication, and Consumer Electronic) industry, promoting the development of intelligent manufacturing. Traditional control methods are incompetent to complete such a high-precision task with Robot Arm due to the difficult-to-model connectors, and existing reinforcement learning methods cannot converge with restricted epochs because of the difficult goals or trajectories. To quickly learn a high-quality assembly for Robot Arm and accelerate the convergence speed, we combine the goal-conditioned reinforcement learning and self-imitation mechanism, balancing the depth and breadth of exploration. The proposal takes visual information and six-dimensions force as state, obtaining satisfactory assembly policies. We build a simulation scene by the Pybullet platform and pre-train the Robot Arm on it, and then the pre-trained policies can be reused in real scenarios with finetuning.

Task Attention-Based Multimodal Fusion and Curriculum Residual Learning for Context Generalization in Robotic Assembly

An Attention-Based Deep Learning Approach for Inertial Motion Recognition and Estimation in Human-Robot Collaboration

A Residual Reinforcement Learning Method for Robotic Assembly Using Visual and Force Information

Hand-in-Hand Guidance: an Explore-Exploit Based Reinforcement Learning Method for Performance Driven Assembly-Adjustment

Vision-force-fused curriculum learning for robotic contact-rich assembly tasks

CLFR-M: Continual Learning Framework for Robots Via Human Feedback and Dynamic Memory

Towards Generalization and Data Efficient Learning of Deep Robotic Grasping

Multi-Modal Fusion in Contact-Rich Precise Tasks via Hierarchical Policy Learning

Extended residual learning with one-shot imitation learning for robotic assembly in semi-structured environment

Using Goal-Conditioned Reinforcement Learning with Deep Imitation to Control Robot Arm in Flexible Flat Cable Assembly Task

AssemblyComplete: 3D Combinatorial Construction with Deep Reinforcement Learning

One-shot sim-to-real transfer policy for robotic assembly via reinforcement learning with visual demonstration

Hierarchical Hybrid Learning for Long-Horizon Contact-Rich Robotic Assembly

Deep reinforcement learning on variable stiffness compliant control for programming-free robotic assembly in smart manufacturing

Accelerating Robot Learning of Contact-Rich Manipulations: A Curriculum Learning Study

Mastering Autonomous Assembly in Fusion Application with Learning-by-doing: a Peg-in-hole Study

Assembly task allocation of human-robot collaboration based on deep reinforcement learning

Digital-Twin-Assisted Skill Learning for 3C Assembly Tasks

Model Accelerated Reinforcement Learning for High Precision Robotic Assembly

Multimodality Driven Impedance-Based Sim2Real Transfer Learning for Robotic Multiple Peg-in-Hole Assembly

Human-robot collaborative assembly task planning for mobile cobots based on deep reinforcement learning