Efficient and Accurate Co-Visible Region Localization with Matching Key-Points Crop (MKPC): A Two-Stage Pipeline for Enhancing Image Matching Performance

Hongjian Song,Yuki Kashiwaba,Shuai Wu,Canming Wang
2023-03-24
Abstract:Image matching is a classic and fundamental task in computer vision. In this paper, under the hypothesis that the areas outside the co-visible regions carry little information, we propose a matching key-points crop (MKPC) algorithm. The MKPC locates, proposes and crops the critical regions, which are the co-visible areas with great efficiency and accuracy. Furthermore, building upon MKPC, we propose a general two-stage pipeline for image matching, which is compatible to any image matching models or combinations. We experimented with plugging SuperPoint + SuperGlue into the two-stage pipeline, whose results show that our method enhances the performance for outdoor pose estimations. What's more, in a fair comparative condition, our method outperforms the SOTA on Image Matching Challenge 2022 Benchmark, which represents the hardest outdoor benchmark of image matching currently.
Computer Vision and Pattern Recognition,Machine Learning
What problem does this paper attempt to address?
The paper attempts to address the problem of improving matching performance in image matching tasks in computer vision. Specifically, the authors propose a hypothesis: the non-overlapping regions outside the two images contain very little information. Based on this hypothesis, the authors propose an algorithm called Matching Key-Points Crop (MKPC) to efficiently and accurately crop out the key regions (i.e., overlapping regions) in the two images. Additionally, the authors design a two-stage pipeline framework that can be compatible with any image matching model and significantly enhance the effectiveness of image matching. Through experimental validation, this method demonstrated stable performance improvement in the outdoor pose estimation task on the PhotoTourism dataset (a subset of the YFCC100M dataset). Particularly, in the Image Matching Challenge 2022 benchmark, this method surpassed the existing state-of-the-art (SOTA) models, indicating its great potential and advantage when integrating multiple models. Although the method is also effective in indoor scenes (such as the ScanNet dataset), it requires higher computational complexity. Overall, the paper demonstrates the effectiveness and generalization capability of this method.