Abstract:Almost all conventional template-matching methods employ low-level image features to measure the similarity between a template image and a scene image using similarity measures such as pixel intensity and pixel gradient. Although these methods have been widely used in many applications, they cannot simultaneously address all types of robustness challenges. In this study, with the goal of simultaneously addressing the various challenges, we present a robust semantic template-matching approach (RSTM). Inspired by the local binary descriptor, we propose a novel superpixel region binary descriptor (SRBD) to construct a multilevel semantic fusion feature vector for RSTM. SRBD uses a new kernel-distance-based simple linear iterative clustering (KD-SLIC) method to extract the stable superpixels from the template image; Then, based on the average intensity difference between each superpixel region and its neighbors, the dominant gradient orientation of each superpixel can be obtained, and the semantic features of each superpixel can be described as the dominant orientation difference vector, which is coded as the rotation-invariant SRBD. In the off-line matching phase, the fusion semantic feature vector of RSTM combines the multilevel SRBD features with different numbers of superpixels. In the online matching phase, to cope with rotation invariance, a marginal probability model is proposed and applied to locate the positions of template images in the scene image. Moreover, to accelerate computation, an image pyramid is employed. We conduct a series of experiments on a large dataset randomly selected from the MS COCO dataset to fully analyze the robustness of this approach. The experimental results show that RSTM simultaneously addresses rotation changes, scale changes, noise, occlusions, blur, nonlinear illumination changes and deformation with high time efficiency while also outperforming previous stateof- the-art template-matching methods.

Realtime and Robust Object Matching with a Large Number of Templates

Realtime object matching with robust dominant orientation templates

Real-time object retrieval with dominant orientation template matching improved by pyramid scoring

APPTracker Plus : Displacement Uncertainty for Occlusion Handling in Low-Frame-Rate Multiple Object Tracking

A fast template matching algorithm based on principal orientation difference

Robust Semantic Template Matching Using a Superpixel Region Binary Descriptor

Learning shared template representation with augmented feature for multi-object pose estimation

Robust object recognition via weakly supervised metric and template learning.

Robust and Accurate Object Tracking under Various Types of Occlusions

Real-time Multi-Object Tracking Based on Bi-directional Matching

Rotation-invariant template matching based on ring projection and orientation codes

Fast and Robust Matching for Multimodal Remote Sensing Image Registration

CDTracker: Coarse-to-Fine Feature Matching and Point Densification for 3D Single-Object Tracking

Robust Object Modeling for Visual Tracking

Visual Object Tracking With Mutual Affinity Aligned to Human Intuition

Occlusion-Aware Visual Object Tracking Based on Multi-template Updating Siamese Network

Fast template matching based on deformable best-buddies similarity measure

Focus On Details: Online Multi-Object Tracking with Diverse Fine-Grained Representation

Real-Time Object Tracking with Motion Information

AtptTrack: Asymmetric Transformer Tracker With Prior Templates

A fast template matching method based on improved ring projection transformation and local dynamic time warping