Multi-Frequency Feature Enhancement for Multi-Granularity Visual Classification

Meijiang Fu,Yixiao Zheng,Dongliang Chang,Wenpan Li,Zhanyu Ma
DOI: https://doi.org/10.1109/apsipaasc58517.2023.10317566
2023-01-01
Abstract:Multi-granularity visual classification is a challenging task derived from traditional image recognition. Previous methods commonly use the features from the final convolutional layer to perform multi-granularity visual classification. However, the features required for different granularity label classification are not consistent. The finer the granularity of the label, the more detailed the features are needed. So, the key to multi-granularity visual classification is to extract effective features for different granularity levels. Generally, the high-frequency parts of natural images usually encode detailed information, while low-frequency parts often encode global structures. Therefore, mapping the output features of convolutional layers into high-frequency and low-frequency parts may enhance feature learning with multi-granularity. In this paper, we decompose the output features from convolutional layers into high-frequency and low-frequency counterparts, and use the combinations of the high-frequency and low-frequency features from various convolutional layers to learn different granularity labels. We evaluate the proposed method on the three public available datasets, including CUB-200-2011, Stanford Cars, and FGVC-Aircraft datasets. The experimental results demonstrate the effectiveness of the proposed method.
What problem does this paper attempt to address?