Multi-discriminative Parts Mining for Fine-Grained Visual Classification

Pingping Zhou,Cheng Pang,Rushi Lan,Guanhua Wu,Yilin Zhang
DOI: https://doi.org/10.1007/978-3-031-47665-5_23
2023-01-01
Abstract:Fine-Grained Visual Classification (FGVC) aims to differentiate visually similar but subtly different subordinate categories of the same basic category. However, current methods primarily exploit deep-layer features to locate to the strong salient part of the network. This paper finds that some subtle but discernible parts and the rich details in shallow-layer features are also valuable for classification. Consequently, this paper proposes a fine-grained visual classification framework that integrates multiple discriminative parts and multi-layer features. Our framework consists of two modules: 1) The attention map based locate-mine module locates the most discriminate part and masks it, thereby encouraging the network to mine other discriminative parts. 2) The multi-layer feature fusion module combines shallow-layer and deep-layer features to enrich local details in discriminative features. We also introduce an adaptive label loss to distinguish categories with high similarity. Experimental results show that our approach achieves excellent performance on three widely used fine-grained benchmark datasets.
What problem does this paper attempt to address?