Low Fidelity Visuo-Tactile Pretraining Improves Vision-Only Manipulation Performance

Selam Gano,Abraham George,Amir Barati Farimani
2024-10-03
Abstract:Tactile perception is a critical component of solving real-world manipulation tasks, but tactile sensors for manipulation have barriers to use such as fragility and cost. In this work, we engage a robust, low-cost tactile sensor, BeadSight, as an alternative to precise pre-calibrated sensors for a pretraining approach to manipulation. We show that tactile pretraining, even with a low-fidelity sensor as BeadSight, can improve an imitation learning agent's performance on complex manipulation tasks. We demonstrate this method against a baseline USB cable plugging task, previously achieved with a much higher precision GelSight sensor as the tactile input to pretraining. Our best BeadSight pretrained visuo-tactile agent completed the task with 70\% accuracy compared to 85\% for the best GelSight pretrained visuo-tactile agent, with vision-only inference for both.
Robotics
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how to use low - cost, low - precision tactile sensors (such as BeadSight) for vision - tactile pre - training in robotic manipulation tasks, in order to improve the performance of imitation - learning agents that use only visual information in complex manipulation tasks. Specifically, the paper explores whether, even with a low - precision tactile sensor, vision - tactile pre - training can improve the performance of agents that rely solely on visual information when performing complex manipulation tasks, especially in tasks such as plugging and unplugging USB cables. In addition, the paper also investigates how to reduce the over - fitting problem by freezing the pre - trained tactile encoder weights, thereby further enhancing the performance of the agent.