Efficient LoFTR: Semi-Dense Local Feature Matching with Sparse-Like Speed

Yifan Wang,Xingyi He,Sida Peng,Dongli Tan,Xiaowei Zhou
DOI: https://doi.org/10.1109/cvpr52733.2024.02047
2024-01-01
Computer Vision and Pattern Recognition
Abstract:We present a novel method for efficiently producing semi-dense matches acrossimages. Previous detector-free matcher LoFTR has shown remarkable matchingcapability in handling large-viewpoint change and texture-poor scenarios butsuffers from low efficiency. We revisit its design choices and derive multipleimprovements for both efficiency and accuracy. One key observation is thatperforming the transformer over the entire feature map is redundant due toshared local information, therefore we propose an aggregated attentionmechanism with adaptive token selection for efficiency. Furthermore, we findspatial variance exists in LoFTR's fine correlation module, which is adverse tomatching accuracy. A novel two-stage correlation layer is proposed to achieveaccurate subpixel correspondences for accuracy improvement. Our efficiencyoptimized model is ∼ 2.5× faster than LoFTR which can even surpassstate-of-the-art efficient sparse matching pipeline SuperPoint + LightGlue.Moreover, extensive experiments show that our method can achieve higheraccuracy compared with competitive semi-dense matchers, with considerableefficiency benefits. This opens up exciting prospects for large-scale orlatency-sensitive applications such as image retrieval and 3D reconstruction.Project page: https://zju3dv.github.io/efficientloftr.
What problem does this paper attempt to address?