Abstract:The use of social media networks and mobile devices has experienced tremendous growth in recent years. This has led to a surge in the number of videos recorded and uploaded to social media platforms like TikTok and YouTube. However, this increase has also resulted in the rise of illegal duplicate videos, which are essentially the same as the original videos but with minor editing effects and variations in coding. In addition, the large number of duplicate videos is a major storage and communication efficiency issue. The task of finding duplicate videos from a large repository is referred to as video deduplication. Video deduplication is a crucial task for applications like saving storage space and detecting copyright infringement. This work proposes a fast and robust location-aware video deduplication system capable of retrieving duplicate videos from a large repository extremely quickly. In addition, the proposed system has the ability to find the precise location of the query video in the retrieved videos. To identify and localize short video clips against large video repositories, we utilize robust image-level features from keypoint aggregation and deep learning along with an efficient KNN search of query frames with a multiple k-d tree setup, giving us a set of candidate video clips. Then, a fast temporal consistence pruning algorithm re-ranks the clip-level candidates and identifies the matching clip along with its temporal location in a sequence in an efficient way. The system was tested on 1 million frame/145 hour and 4.5 million frame/636 hour repositories generated via the large-scale FIVR-200K and VCSL datasets, respectively. The proposed system achieves a recall of 98.8% and 94.1% for the FIVR-200K and VCSL datasets, respectively. A query frame is searched as fast as 83.96ms and 462.59ms from a 1 million frame/145 hour and a 4.5 million frame/636 hour repository, respectively. These experimental results demonstrate that our system is highly accurate and that the time consumption is extremely low for retrieving video along with its timestamp information from large-scale repositories.

Accelerating Near-Duplicate Video Matching by Combining Visual Similarity and Alignment Distortion.

Near-duplicate Keyframe Retrieval by Nonrigid Image Matching.

A Global Approach for Video Matching

Near-duplicate Keyframe Retrieval by Semi-Supervised Learning and Nonrigid Image Matching

A Fast Algorithm for Near-duplicate Video Detection

Near Duplicate Keyframes Identifying and Correlation Analyzing of News Video Stories

Motion-Based Temporal Alignment of Independently Moving Cameras

Online Near-Duplicate Video Clip Detection and Retrieval: an Accurate and Fast System

Fast And Robust Detection Of Near-Duplicates In Web Video Database

A Fast Near-duplicate Keyframe Detection Method Based on Local Features

Fast Tracking of Near-Duplicate Keyframes in Broadcast Domain with Transitivity Propagation

A Detection Method for Near Duplicate Video Clips Based on Content Similarity

Statistical Summarization of Content Features for Fast Near-Duplicate Video Detection.

Tracking cross-lingual news stories with efficient near duplicate detection

An efficient near-duplicate video shot detection method using shot-based interest points

A New Similarity Measure for Near Duplicate Video Clip Detection

Multiscale Video Sequence Matching for Near-Duplicate Detection and Retrieval

Efficient and Continuous Near-duplicate Video Detection

Near-Duplicate Video Retrieval and Localization Using Relative Levenshtein Distance Similarity

Subframe Video Synchronization by Matching Trajectories

Fast Video Deduplication and Localization with Temporal Consistence Re-Ranking