Discriminative Region Enhancing and Suppression Network for Fine-Grained Visual Categorization

Guanhua Wu,Cheng Pang,Rushi Lan,Yilin Zhang,Pingping Zhou
DOI: https://doi.org/10.1007/978-3-031-47665-5_8
2023-01-01
Abstract:Attention mechanisms are intensively devoted to local feature abstraction for fine-grained visual categorization. A limitation of attention-based methods is that they focus on salient region mining and feature extraction, while ignoring the ability to incorporate discriminative and complementary features from other parts of the image. In order to address this issue, we introduce a novel network known as the Discriminative Region Enhancing and Suppression Network (DRESNet). This network efficiently extracts a wide range of diverse and complementary features, thereby enhancing the final representation. Specifically, a plug-and-play salient region diffusion (SRD) module is proposed to explicitly enhances the salient features extracted by any backbone network. The SRD module can adaptively adjust the weights of regions and redirect attention to other non-discriminative regions to generate different complementary features. The proposed discriminative region enhancing and suppression network is free from bounding boxes or part annotations and can be trained end-to-end. Our proposed method demonstrates competitive performance on three fine-grained classification benchmark datasets, as supported by extensive experimental results. Additionally, it is compatible with widely used frameworks currently in use.
What problem does this paper attempt to address?