Fast-UMI: A Scalable and Hardware-Independent Universal Manipulation Interface

Ziniu Wu,Tianyu Wang,Zhaxizhuoma,Chuyue Guan,Zhongjie Jia,Shuai Liang,Haoming Song,Delin Qu,Dong Wang,Zhigang Wang,Nieqing Cao,Yan Ding,Bin Zhao,Xuelong Li
2024-09-29
Abstract:Collecting real-world manipulation trajectory data involving robotic arms is essential for developing general-purpose action policies in robotic manipulation, yet such data remains scarce. Existing methods face limitations such as high costs, labor intensity, hardware dependencies, and complex setup requirements involving SLAM algorithms. In this work, we introduce Fast-UMI, an interface-mediated manipulation system comprising two key components: a handheld device operated by humans for data collection and a robot-mounted device used during policy inference. Our approach employs a decoupled design compatible with a wide range of grippers while maintaining consistent observation perspectives, allowing models trained on handheld-collected data to be directly applied to real robots. By directly obtaining the end-effector pose using existing commercial hardware products, we eliminate the need for complex SLAM deployment and calibration, streamlining data processing. Fast-UMI provides supporting software tools for efficient robot learning data collection and conversion, facilitating rapid, plug-and-play functionality. This system offers an efficient and user-friendly tool for robotic learning data acquisition.
Robotics
What problem does this paper attempt to address?
The problem this paper attempts to address is: In the field of robotic manipulation, collecting real-world data of robot arms interacting with objects is crucial for developing general action strategies, but such data is currently very scarce. Existing methods have limitations such as high cost, labor intensity, strong hardware dependency, and complex setup requirements (e.g., requiring SLAM algorithms). To overcome these challenges, the paper proposes an interface-mediated manipulation system called Fast-UMI. This system addresses the aforementioned issues in the following ways: 1. **Decoupled Design**: Fast-UMI adopts a decoupled design, making it compatible with various grippers, thereby improving the system's adaptability. 2. **Rapid User Deployment**: The system is designed to be plug-and-play, simplifying the installation and configuration process, allowing users to deploy quickly. 3. **Supporting Software Tools**: It provides efficient tools for robotic learning data collection and transformation, ensuring seamless integration of data acquisition and processing. 4. **Enhanced Scalability**: The system is designed to support multimodal datasets, allowing for the future inclusion of more types of sensors and data types. Through these improvements, Fast-UMI aims to provide an efficient and user-friendly tool for the collection of robotic learning data, thereby promoting the development of general action strategies.