A New Framework for Image Dataset Construction with Web Images

Y Yao,J Zhang,F Shen,X Hua,J Xu,Z Tang
2015-01-01
Abstract:Labelled image datasets have played a critical role in high-level image understanding. In the early years, manual annotation was the most important way to construct image datasets.(eg, STL-10 [1], CIFAR-10 [2], PASCAL VOC 2007 [3], ImageNet [4] and Caltech-101 [5]). However, the process of manual labelling is both timeconsuming and labor intensive. With the development of the Internet, methods of exploiting web images for automatic image dataset construction have recently become a hot topic in the field of multimedia processing.Schroff et al.[6] adopted text information to rank images retrieved from a web search and used these top-ranked images to learn visual models to re-rank images once again. Li et al.[7] leveraged the first few images returned from an image search engine to train the image classifier, which uses incremental learning to refine its model. With the increase in the number of positive images accepted by the classifier, the trained classifier will reach a robust level for this query. Hua et al.[8] proposed to use clustering based method to filter “group” noisy images and propagation based method to filter individual noisy images.
What problem does this paper attempt to address?