Abstract:How to precisely and efficiently detect near-duplicate copies with complicated audiovisual transformations from a large-scale video database is a challenging task. To cope with this challenge, this article proposes a transformation-aware soft cascading (TASC) approach for multimodal video copy detection. Basically, our approach divides query videos into some categories and then for each category designs a transformation-aware chain to organize several detectors in a cascade structure. In each chain, efficient but simple detectors are placed in the forepart, whereas effective but complex detectors are located in the rear. To judge whether two videos are near-duplicates, a Detection-on-Copy-Units mechanism is introduced in the TASC, which makes the decision of copy detection depending on the similarity between their most similar fractions, called copy units (CUs), rather than the video-level similarity. Following this, we propose a CU search algorithm to find a pair of CUs from two videos and a CU-based localization algorithm to find the precise locations of their copy segments that are with the asserted CUs as the center. Moreover, to address the problem that the copies and noncopies are possibly linearly inseparable in the feature space, the TASC also introduces a flexible strategy, called soft decision boundary, to replace the single threshold strategy for each detector. Its basic idea is to automatically learn two thresholds for each detector to examine the easy-to-judge copies and noncopies, respectively, and meanwhile to train a nonlinear classifier to further check those hard-to-judge ones. Extensive experiments on three benchmark datasets showed that the TASC can achieve excellent copy detection accuracy and localization precision with a very high processing efficiency.

Video Copy-Detection and Localization with a Scalable Cascading Framework

Video Copy Detection Using a Soft Cascade of Multimodal Features

TASC: A Transformation-Aware Soft Cascading Approach for Multimodal Video Copy Detection

A Multimodal Video Copy Detection Approach with Sequential Pyramid Matching

Video Copy Detection Based on Multiple Visual Feature Matching

Content-based copy detection through multimodal feature representation and temporal pyramid matching

A content-based video copy detection method with randomly projected binary features.

PKU-IDM @TRECVID2011 CBCD: Content-Based Copy Detection with Cascade of Multimodal Features and Temporal Pyramid Matching.

A video copy detection algorithm combining local feature's robustness and global feature's speed

Multilevel Spatial-Temporal Feature Aggregation for Video Object Detection

Large Margin Object Tracking with Circulant Feature Maps

A Hierarchical Scheme for Rapid Video Copy Detection

Video copy detection based on multiple visual features synthesizing

Video object matching across multiple non-overlapping camera views based on multi-feature fusion and incremental learning.

Fine-search for image copy detection based on local affine-invariant descriptor and spatial dependent matching

Fast copy detection based on Slice Entropy Scattergraph

A Fast Video Copy Detection Approach By Dynamic Programming

A Segmentation and Graph-Based Video Sequence Matching Method for Video Copy Detection

GPU-Accelerated Video Copy Detection Based on Incremental Clustering

A Cascade Tracking Algorithm Based On Artificial Beacon And Hog Features

Visual words based spatiotemporal sequence matching in video copy detection