Abstract:Autonomous manipulation in everyday tasks requires flexible action generation to handle complex, diverse real-world environments, such as objects with varying hardness and softness. Imitation Learning (IL) enables robots to learn complex tasks from expert demonstrations. However, a lot of existing methods rely on position/unilateral control, leaving challenges in tasks that require force information/control, like carefully grasping fragile or varying-hardness objects. As the need for diverse controls increases, there are demand for low-cost bimanual robots that consider various motor inputs. To address these challenges, we introduce Bilateral Control-Based Imitation Learning via Action Chunking with Transformers(Bi-ACT) and"A" "L"ow-cost "P"hysical "Ha"rdware Considering Diverse Motor Control Modes for Research in Everyday Bimanual Robotic Manipulation (ALPHA-$\alpha$). Bi-ACT leverages bilateral control to utilize both position and force information, enhancing the robot's adaptability to object characteristics such as hardness, shape, and weight. The concept of ALPHA-$\alpha$ is affordability, ease of use, repairability, ease of assembly, and diverse control modes (position, velocity, torque), allowing researchers/developers to freely build control systems using ALPHA-$\alpha$. In our experiments, we conducted a detailed analysis of Bi-ACT in unimanual manipulation tasks, confirming its superior performance and adaptability compared to Bi-ACT without force control. Based on these results, we applied Bi-ACT to bimanual manipulation tasks. Experimental results demonstrated high success rates in coordinated bimanual operations across multiple tasks. The effectiveness of the Bi-ACT and ALPHA-$\alpha$ can be seen through comprehensive real-world experiments. Video available at: <a class="link-external link-https" href="https://mertcookimg.github.io/alpha-biact/" rel="external noopener nofollow">this https URL</a>

Learning Fine-Grained Bimanual Manipulation with Low-Cost Hardware

Learning Robot Manipulation Skills from Human Demonstration Videos Using Two-Stream 2-D/3-D Residual Networks with Self-Attention

ALOHA Unleashed: A Simple Recipe for Robot Dexterity

Learning Variable Compliance Control From a Few Demonstrations for Bimanual Robot with Haptic Feedback Teleoperation System

Mobile ALOHA: Learning Bimanual Mobile Manipulation with Low-Cost Whole-Body Teleoperation

Precise and Dexterous Robotic Manipulation via Human-in-the-Loop Reinforcement Learning

Haptic-ACT: Bridging Human Intuition with Compliant Robotic Manipulation via Immersive VR

A System for Imitation Learning of Contact-Rich Bimanual Manipulation Policies

Vision-Based Multi-Task Manipulation for Inexpensive Robots Using End-To-End Learning from Demonstration

InterACT: Inter-dependency Aware Action Chunking with Hierarchical Attention Transformers for Bimanual Manipulation

ALPHA-$α$ and Bi-ACT Are All You Need: Importance of Position and Force Information/Control for Imitation Learning of Unimanual and Bimanual Robotic Manipulation with Low-Cost System

Dexterous Manipulation with Deep Reinforcement Learning: Efficient, General, and Low-Cost

VITAL: Visual Teleoperation to Enhance Robot Learning through Human-in-the-Loop Corrections

Learning to Manipulate Tools by Aligning Simulation to Video Demonstration

Human-Agent Joint Learning for Efficient Robot Manipulation Skill Acquisition

Efficient Robot Skill Learning with Imitation from a Single Video for Contact-Rich Fabric Manipulation

MimicTouch: Leveraging Multi-modal Human Tactile Demonstrations for Contact-rich Manipulation

Dexterous Imitation Made Easy: A Learning-Based Framework for Efficient Dexterous Manipulation

Dexterous Manipulation from Images: Autonomous Real-World RL via Substep Guidance

Giving Robots a Hand: Learning Generalizable Manipulation with Eye-in-Hand Human Video Demonstrations

Learning Fine Pinch-Grasp Skills using Tactile Sensing from A Few Real-world Demonstrations