Bridge the gap between supervised and unsupervised learning for fine-grained classification
Jiabao Wang,Yang Li,Xiu-Shen Wei,Hang Li,Zhuang Miao,Rui Zhang
DOI: https://doi.org/10.1016/j.ins.2023.119653
IF: 8.1
2023-09-14
Information Sciences
Abstract:Unsupervised learning technology has caught up with or even surpassed supervised learning technology in general object classification (GOC) and person re-identification (re-ID). However, it has been discovered that the unsupervised learning of fine-grained visual classification (FGVC) is more difficult than GOC and person re-ID. To bridge the gap between unsupervised and supervised learning for FGVC, we investigate the essential factors (including feature extraction, clustering, and contrastive learning) for the performance gap between supervised and unsupervised FGVC. Furthermore, we propose a simple, effective, and practical method, termed as unsupervised fine-grained clustering learning (UFCL), to alleviate this gap. Three key issues are concerned and improved: First, we introduce a robust and powerful backbone, ResNet50-IBN, which has the ability of domain adaptation when we transfer ImageNet pre-trained models to FGVC tasks. Next, we propose to introduce HDBSCAN rather than DBSCAN for clustering, which can generate better clusters for adjacent categories with fewer hyper-parameters. Finally, we propose a weighted feature agent and its update mechanism to perform contrastive learning by employing pseudo labels with unavoidable noise, which can enhance the optimization process of learning the network's parameters. The effectiveness of our UFCL was confirmed on the CUB-200-2011, Oxford-Flowers, Oxford-Pets, Stanford-Dogs, Stanford-Cars, and FGVC-Aircraft datasets. Under the unsupervised FGVC setting, we achieved state-of-the-art results and examined the primary factors and crucial parameters to offer practical guidance.
computer science, information systems