Visuo-Tactile Keypoint Correspondences for Object Manipulation

Jeong-Jung Kim,Doo-Yeol Koh,Chang-Hyun Kim
2024-05-23
Abstract:This paper presents a novel manipulation strategy that uses keypoint correspondences extracted from visuo-tactile sensor images to facilitate precise object manipulation. Our approach uses the visuo-tactile feedback to guide the robot's actions for accurate object grasping and placement, eliminating the need for post-grasp adjustments and extensive training. This method provides an improvement in deployment efficiency, addressing the challenges of manipulation tasks in environments where object locations are not predefined. We validate the effectiveness of our strategy through experiments demonstrating the extraction of keypoint correspondences and their application to real-world tasks such as block alignment and gear insertion, which require millimeter-level precision. The results show an average error margin significantly lower than that of traditional vision-based methods, which is sufficient to achieve the target tasks.
Robotics
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how to achieve precise object grasping and placement in robotic manipulation tasks, especially in environments where the object positions are not predefined. Traditional vision - based methods face the challenges of sensor noise and environmental interference in these environments, which affect the operational accuracy. For this reason, the paper proposes a new manipulation strategy, which uses the key - point correspondences extracted from visual - tactile sensor images to guide the robot's actions, thereby improving the manipulation precision, reducing the need for post - grasping adjustment, and decreasing the dependence on extensive training and improving the deployment efficiency. Specifically, the paper makes the following contributions: 1. **Method innovation**: A method for achieving precise manipulation without additional learning by using key - point correspondences in visual - tactile sensor data is proposed. 2. **Practical application verification**: The feasibility and reliability of this method in practical tasks, such as block alignment and gear insertion tasks that require millimeter - level precision, are demonstrated through experiments. The paper verifies the effectiveness of the proposed method through experiments, indicating that key - point correspondences can be accurately extracted from visual - tactile images, and the average position error is low enough to enable precise manipulation through techniques such as impedance control. In addition, this method performs well in tasks that require millimeter - level precision, such as block alignment and gear insertion, which are very challenging for traditional vision - based systems.