Abstract:Single-resolution homography estimation of infrared and visible images is a significant and challenging research area within the field of computing, which has attracted a great deal of attention. However, due to the large modal differences between infrared and visible images, existing methods are difficult to stably and accurately extract and match features between the two image types at a single resolution, which results in poor performance on the homography estimation task. To address this issue, this paper proposes an end-to-end unsupervised single-resolution infrared and visible image homography estimation method based on graph neural network (GNN), homoViG. Firstly, the method employs a triple attention shallow feature extractor to capture cross-dimensional feature dependencies and enhance feature representation effectively. Secondly, Vision GNN (ViG) is utilized as the backbone network to transform the feature point matching problem into a graph node matching problem. Finally, this paper proposes a new homography estimator, residual fusion vision graph neural network (RFViG), to reduce the feature redundancy caused by the frequent residual operations of ViG. Meanwhile, RFViG replaces the residual connections with an attention feature fusion module, highlighting the important features in the low-level feature graph. Furthermore, this model introduces detail feature loss and feature identity loss in the optimization phase, facilitating network optimization. Through extensive experimentation, we demonstrate the efficacy of all proposed components. The experimental results demonstrate that homoViG outperforms existing methods on synthetic benchmark datasets in both qualitative and quantitative comparisons.

Unsupervised Deep Homography Estimation Based on Transformer.

Unsupervised deep homography with multi-scale global attention.

Deep Homography Estimation With Feature Correlation Transformer

Content-Aware Unsupervised Deep Homography Estimation and its Extensions

Recurrent Homography Estimation Using Homography-Guided Image Warping and Focus Transformer

Unsupervised Deep Homography: A Fast and Robust Homography Estimation Model

A Depth Estimation Framework Based on Unsupervised Learning and Cross-Modal Translation

AbHE: All Attention-Based Homography Estimation

RTHEN: Unsupervised deep homography estimation based on dynamic attention for repetitive texture image stitching

Self-Supervised Deep Homography Estimation with Invertibility Constraints

STN-Homography: Direct Estimation of Homography Parameters for Image Pairs

Deep Homography Estimation with Pairwise Invertibility Constraint.

Image stitching via deep homography estimation

LocalTrans: A Multiscale Local Transformer Network for Cross-Resolution Homography Estimation

Deep Unsupervised Homography Estimation for Single-Resolution Infrared and Visible Images Using GNN

InterNet: Unsupervised Cross-modal Homography Estimation Based on Interleaved Modality Transfer and Self-supervised Homography Prediction

Coarse-to-Fine Homography Estimation for Infrared and Visible Images

Bilevel progressive homography estimation via correlative region-focused transformer

Iterative Deep Homography Estimation

76‐3: A Modified Unsupervised Vision Transformer Network for High‐fidelity Computer‐generated Holography

CrossHomo: Cross-Modality and Cross-Resolution Homography Estimation