Tracking cross-lingual news stories with efficient near duplicate detection

Jun Wen,Lingda Wu,Pu Zeng,XiDao Luan
2008-01-01
Journal of Information and Computational Science
Abstract:Tracking cross-lingual news stories is very useful for organizing dynamic and large-scale news video corpus. However, the quadratic complexity required for measuring the similarity of news stories makes it intractable in large-volume news videos. In this paper, we propose a fast and effective method to overcome the problem. At first, we select small partitions from the corpus by temporal, channel, story structure and visual scene. Then, within each partition, we explore a method to identify near duplicate keyframes with pruning local keypoints, a SIFT-based matching approach and other properties, which is drastic speed and high efficiency. Finally we link similar stories from different partitions via transitivity for tracking news sotires. Experiments show that our approach greatly speeds up the matching speed and improve the matching accuracy.
What problem does this paper attempt to address?