Abstract:Partial video copy detection (PVCD) aims to discover copy segments of query videos from a video database, which plays an important role in video copyright protection, filtering, tracking, etc. For a large-scale video database, PVCD can be divided into two stages: the first stage involves searching for video-level copies of the query video in the database, and the second stage is to further localize the copy segments within the video-level copies. Thus, two major challenges arise: (1) efficiently and effectively calculating the similarity between videos; (2) localizing mixed-duration video pairs. To address the above challenges, we propose an efficient PVCD approach for a large-scale video database, based on the Bag-of-Words (BoW) framework, which decouples video-level similarity and copy localization into cell-level. This approach consists of two modules. The first is an efficient video similarity measurement (VSM) module for the large-scale video database. VSM aggregates cell-level similarity into video-level similarity, and with a dual index, it greatly improves retrieval speed while accurately measuring spatiotemporal transformations. The second is a greedy pattern detection (GPD) module for video copy localization. GPD quickly and accurately detects similarity patterns through a greedy strategy on the similarity matrix formed by matching frames in each cell, then aggregates them into complete predicted copy segments. On the comprehensive dataset self-SVD, VSM significantly outperforms state-of-the-art methods by 7.28% in mAP, and the retrieval speed is increased by over 318 times. Additionally, for short videos at the scale of hundreds of millions, the response speed can theoretically reach seconds. On the copy localization dataset MIX, composed of mixed-duration videos, GPD also achieves the best performance.

Visual words based spatiotemporal sequence matching in video copy detection

Video Copy Detection Based on Multiple Visual Feature Matching

A content-based video copy detection method with randomly projected binary features.

A Multimodal Video Copy Detection Approach with Sequential Pyramid Matching

Spatiotemporal Video Copy Detection Based on Visual Perception Analyses

Video Copy Detection Based on Spatiotemporal Fusion Model

Semantic Sequence Kin: A Method of Document Copy Detection

A Sparse Representation-Based Approach for Video Copy Detection

Video copy detection based on multiple visual features synthesizing

A Segmentation and Graph-Based Video Sequence Matching Method for Video Copy Detection

An Improved Video Identification Scheme Based on Video Tomography.

Content-based copy detection through multimodal feature representation and temporal pyramid matching

Invariant Visual Patterns for Video Copy Detection

An Efficient Partial Video Copy Detection for a Large-scale Video Database

Video Identification Using Spatio-temporal Salient Points

Video copy detection method based on contents

Fine-search for image copy detection based on local affine-invariant descriptor and spatial dependent matching

Video Copy-Detection and Localization with a Scalable Cascading Framework

Continuous Content-Based Copy Detection over Streaming Videos

A Hierarchical Scheme for Rapid Video Copy Detection

Shrinking the Semantic Gap: Spatial Pooling of Local Moment Invariants for Copy-Move Forgery Detection