Unsupervised Deep Homography Estimation Based on Transformer.

Tianjian Jiang,Qiu Fang,Qing Zhu,Yaonan Wang,Zhen Zhou,Lin Chen,Jiaming Zhou,Yuefan Luo,Chengzhong Wu
DOI: https://doi.org/10.1109/icarm58088.2023.10218971
2023-01-01
Abstract:Homography estimation is a crucial problem in computer vision, which aims to provide an optimal transformation matrix for aligning images captured from different viewpoints. Current methods extract shallow features from image pairs and introduce learnable mask modules to improve homography estimation performance. However, they struggle to capture long-term dependencies between features and comprehend the global structures of image features. A deep unsupervised homography learning framework is proposed in this paper, consisting of a weight-sharing feature extraction network and a homography estimation network based on the Transformer model. The former extracts the local features of images, while the latter learns the correlation between them and understands the global features of images, enabling the algorithm to better estimate the homography of unaligned images. Experimental results demonstrate that the proposed method outperforms the advanced methods for estimating homography matrices in the CA-Unsupervised dataset.
What problem does this paper attempt to address?