Dense Metric with Meta-Classifier for Few-Shot Image Classification

Yong Wang,Kaitong Li,Xiaoyu He
DOI: https://doi.org/10.1109/icaci60820.2024.10537021
2024-01-01
Abstract:Metric-based methods have achieved great success in few-shot image classification (FSIC). Most existing metric-based methods employ distance metrics as the basis for FSIC, which model the similarity relationship of each pair of query and class feature maps by calculating the distance between the feature vectors pooled from the feature maps. In our view, there exist two issues in these methods: 1) the pooling operation causes the loss of local similarity information, which is harmful to classification; 2) the distance metrics usually ignore the inter-class information, which leads to that the distance is task-agnostic. To address these two issues, we propose a Meta Dense Metric Network called MDMNet. Specifically, we first propose a dense metric module, which calculates the distance between feature vectors at each spatial location for each pair of query and class feature maps, and generates a dense metric feature map. By adopting the dense metric operation instead of the pooling operation, the generated dense metric feature map not only contains rich local similarity information, but also preserves spatial structural information. Besides, we design a meta-classifier with learnable convolutions to achieve inter-class aggregation of the information in all the dense metric feature maps, thus generating task-specific FSIC scores. Extensive experiments verify the effectiveness of MDMNet.
What problem does this paper attempt to address?