Extracting Useful Knowledge from Noisy Web Images Via Data Purification for Fine-Grained Recognition.

Chuanyi Zhang,Yazhou Yao,Xing Xu,Jie Shao,Jingkuan Song,Zechao Li,Zhenmin Tang
DOI: https://doi.org/10.1145/3474085.3475536
2021-01-01
Abstract:Fine-grained visual recognition tasks typically require training data with reliable acquisition and annotation processes. Acquiring such datasets with precise fine-grained annotations is very expensive and time-consuming. Conversely, a vast amount of web data is relatively easy to obtain with nearly no human effort. Nevertheless, the presence of label noise in web images becomes a huge obstacle for training robust fine-grained recognition models. In this work, we investigate the noisy label problem and propose a method that can specifically distinguish in- and out-of-distribution noisy samples. It can purify the web training data by discarding out-of-distribution noisy images and relabeling in-distribution ones. After purification, we can train the model on a less noisy web training set to achieve better robustness and performance. Extensive experiments on three real-world web datasets for fine-grained visual recognition demonstrate the superiority of our approach.
What problem does this paper attempt to address?