Abstract:How to precisely and efficiently detect near-duplicate copies with complicated audiovisual transformations from a large-scale video database is a challenging task. To cope with this challenge, this article proposes a transformation-aware soft cascading (TASC) approach for multimodal video copy detection. Basically, our approach divides query videos into some categories and then for each category designs a transformation-aware chain to organize several detectors in a cascade structure. In each chain, efficient but simple detectors are placed in the forepart, whereas effective but complex detectors are located in the rear. To judge whether two videos are near-duplicates, a Detection-on-Copy-Units mechanism is introduced in the TASC, which makes the decision of copy detection depending on the similarity between their most similar fractions, called copy units (CUs), rather than the video-level similarity. Following this, we propose a CU search algorithm to find a pair of CUs from two videos and a CU-based localization algorithm to find the precise locations of their copy segments that are with the asserted CUs as the center. Moreover, to address the problem that the copies and noncopies are possibly linearly inseparable in the feature space, the TASC also introduces a flexible strategy, called soft decision boundary, to replace the single threshold strategy for each detector. Its basic idea is to automatically learn two thresholds for each detector to examine the easy-to-judge copies and noncopies, respectively, and meanwhile to train a nonlinear classifier to further check those hard-to-judge ones. Extensive experiments on three benchmark datasets showed that the TASC can achieve excellent copy detection accuracy and localization precision with a very high processing efficiency.

Video Copy Detection Using a Soft Cascade of Multimodal Features

Video Copy-Detection and Localization with a Scalable Cascading Framework

TASC: A Transformation-Aware Soft Cascading Approach for Multimodal Video Copy Detection

Video Copy Detection Based on Multiple Visual Feature Matching

A Multimodal Video Copy Detection Approach with Sequential Pyramid Matching

Content-based copy detection through multimodal feature representation and temporal pyramid matching

Video copy detection based on multiple visual features synthesizing

A content-based video copy detection method with randomly projected binary features.

Multimodal fusion for video copy detection

PKU-IDM @TRECVID2011 CBCD: Content-Based Copy Detection with Cascade of Multimodal Features and Temporal Pyramid Matching.

A video copy detection algorithm combining local feature's robustness and global feature's speed

Learning To Multimodal Hash For Robust Video Copy Detection

Partial Copy Detection in Videos: A Benchmark and an Evaluation of Popular Methods.

Video Copy Detection Method: A Review

A Dual-level Detection Method for Video Copy Detection

Video Copy Detection Based on Spatiotemporal Fusion Model

Efficient Image Copy Detection Using Multiscale Fingerprints

Video object matching across multiple non-overlapping camera views based on multi-feature fusion and incremental learning.

A Hierarchical Scheme for Rapid Video Copy Detection

Fast copy detection based on Slice Entropy Scattergraph

A Rotation Invariant Descriptor for Robust Video Copy Detection