Vision-based Tactile Image Generation via Contact Condition-guided Diffusion Model

Xi Lin,Weiliang Xu,Yixian Mao,Jing Wang,Meixuan Lv,Lu Liu,Xihui Luo,Xinming Li
2024-12-02
Abstract:Vision-based tactile sensors, through high-resolution optical measurements, can effectively perceive the geometric shape of objects and the force information during the contact process, thus helping robots acquire higher-dimensional tactile data. Vision-based tactile sensor simulation supports the acquisition and understanding of tactile information without physical sensors by accurately capturing and analyzing contact behavior and physical properties. However, the complexity of contact dynamics and lighting modeling limits the accurate reproduction of real sensor responses in simulations, making it difficult to meet the needs of different sensor setups and affecting the reliability and effectiveness of strategy transfer to practical applications. In this letter, we propose a contact-condition guided diffusion model that maps RGB images of objects and contact force data to high-fidelity, detail-rich vision-based tactile sensor images. Evaluations show that the three-channel tactile images generated by this method achieve a 60.58% reduction in mean squared error and a 38.1% reduction in marker displacement error compared to existing approaches based on lighting model and mechanical model, validating the effectiveness of our approach. The method is successfully applied to various types of tactile vision sensors and can effectively generate corresponding tactile images under complex loads. Additionally, it demonstrates outstanding reconstruction of fine texture features of objects in a Montessori tactile board texture generation task.
Robotics
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how to generate tactile images with higher fidelity more accurately in visual - tactile sensor simulation, so as to reduce the gap with practical applications. Specifically, when dealing with the complexity of contact dynamics and illumination modeling, the existing visual - tactile sensor simulation methods are difficult to accurately reproduce the responses of real sensors, which limits their applicability in different sensor settings and affects the reliability and effectiveness of strategies transferred to practical applications. For this reason, the author proposes a method based on the contact - condition - guided diffusion model. This method can generate high - detail, high - quality visual - tactile sensor images from the RGB images of objects and contact force data, thereby enhancing the realism of simulation and the effectiveness of applications. The main contributions of the paper include: - Proposing a new contact - condition - guided diffusion model method for pixel - level data mapping between different data domains to learn the optical environment of the sensor and the deformation movement of the elastomer. The mean - square error (MSE) of the RGB - illumination tactile images generated by this method is 62.97% lower than that of the existing methods based on illumination models and mechanical models. - This method is applicable to various customized visual - tactile sensors, such as photometric stereo vision and marker - based systems. Under different contact loads, this method outperforms other tactile image simulators in the marker displacement error index, achieving a 55.61% reduction in marker displacement error. - This model is applied to the texture generation task of the Montessori tactile board, demonstrating its efficiency in accurately restoring subtle texture features.