Abstract:Feature based image matching has been a research focus in photogrammetry and computer vision for decades, as it is the basis for many applications where multi-view geometry is needed. A typical feature based image matching algorithm contains five steps: feature detection, affine shape estimation, orientation assignment, description and descriptor matching. This paper contains innovative work in different steps of feature matching based on convolutional neural networks (CNN). For the affine shape estimation and orientation assignment, the main contribution of this paper is twofold. First, we define a canonical shape and orientation for each feature. As a consequence, instead of the usual Siamese CNN, only single branch CNNs needs to be employed to learn the affine shape and orientation parameters, which turns the related tasks from supervised to self supervised learning problems, removing the need for known matching relationships between features. Second, the affine shape and orientation are solved simultaneously. To the best of our knowledge, this is the first time these two modules are reported to have been successfully trained together. In addition, for the descriptor learning part, a new weak match finder is suggested to better explore the intra-variance of the appearance of matched features. For any input feature patch, a transformed patch that lies far from the input feature patch in descriptor space is defined as a weak match feature. A weak match finder network is proposed to actively find these weak match features; they are subsequently used in the standard descriptor learning framework. The proposed modules are integrated into an inference pipeline to form the proposed feature matching algorithm. The algorithm is evaluated on standard benchmarks and is used to solve for the parameters of image orientation of aerial oblique images. It is shown that deep learning feature based image matching leads to more registered images, more reconstructed 3D points and a more stable block geometry than conventional methods. The code is available at https://github.com/Childhoo/Chen_Matcher.git.

OD-Net: Orthogonal descriptor network for multiview image keypoint matching

OANet: Learning Two-View Correspondences and Geometry Using Order-Aware Network.

OVPT: Optimal Viewset Pooling Transformer for 3D Object Recognition.

Explore Better Network Framework for High-Resolution Optical and SAR Image Matching

Learning Enriched Feature Descriptor for Image Matching and Visual Measurement

Robust Local Feature Descriptor for Multisource Remote Sensing Image Registration

A Concurrent Multiscale Detector for End-to-End Image Matching

Learning Two-View Correspondences and Geometry Using Order-Aware Network

OD-MVSNet: Omni-dimensional dynamic multi-view stereo network

P2-Net - Joint Description and Detection of Local Features for Pixel and Point Matching.

OCR is All you need: Importing Multi-Modality into Image-based Defect Detection System

Orthogonal Decomposition Network for Pixel-Wise Binary Classification

Deep learning feature representation for image matching under large viewpoint and viewing direction change

A Region-Based Descriptor Network for Uniformly Sampled Keypoints

DRFD-Net: Using Dual Receptive Field Descriptors for Multitemporal Optical Remote Sensing Image Registration

Adaptive Context- and Scale-Aware Aggregation with Feature Alignment for One-Shot Object Detection.

Multiview Image Matching of Optical Satellite and UAV Based on a Joint Description Neural Network

ODFNet: Using orientation distribution functions to characterize 3D point clouds

MT-ORL: Multi-Task Occlusion Relationship Learning

Orthogonal Vector-Decomposed Disentanglement Network of Interactive Image Retrieval for Fashion Outfit Recommendation

Occ$^2$Net: Robust Image Matching Based on 3D Occupancy Estimation for Occluded Regions