Large-Scale Duplicate Detection for Web Image Search

Bin Wang,Zhiwei Li,Mingjing Li,Wei-ying Ma
DOI: https://doi.org/10.1109/ICME.2006.262509
2006-01-01
Abstract:Finding visually identical images in large image collections is important for many applications such as intelligence propriety protection and search result presentation. Several algorithms have been reported in the literature, but they are not suitable for large image collections. In this paper, a novel algorithm is proposed to handle the situation, in which each image is compactly represented by a hash code. To detect duplicate images, only the hash codes are required. In addition, a very efficient search method is implemented to quickly group images with similar hash codes for fast detection. The experiments show that our algorithm can be both efficient and effective for duplicate detection in Web image search
What problem does this paper attempt to address?