Visual words based spatiotemporal sequence matching in video copy detection

Huamin Ren,Shouxun Lin,Dongming Zhang,Sheng Tang,Ke Gao
DOI: https://doi.org/10.1109/ICME.2009.5202761
2009-01-01
Abstract:This paper proposes a novel content-based copy retrieval scheme for video copy identification. Its goal is to detect matches between a doubtful video and the ones stored in the database of the legal holders of the videos. Due to various transformations the copy may has, we use visual words vector as a representation of a frame which is based on SIFT descriptor. Unlike traditional Bag-of-Words (BoW) based approach applied in semantic retrieval, in which the temporal variation during the video is always neglected, our matching algorithm takes into account spatial and temporal distances between a query clip and the one in database. Experiments show robustness and effectiveness of our approach according to various single and compound transformations.
What problem does this paper attempt to address?