Fine-grained image recognition via trusted multi-granularity information fusion

Ying Yu,Hong Tang,Jin Qian,Zhiliang Zhu,Zhen Cai,Jingqin Lv
DOI: https://doi.org/10.1007/s13042-022-01685-6
2023-03-19
International Journal of Machine Learning and Cybernetics
Abstract:Fine-grained image recognition (FGIR) is more challenging than general image recognition tasks due to the inherently subtle object variation. The existing FGIR methods are mainly based on single-granularity feature fusion, the extracted fused features often cannot fully reflect the characteristics of the object, and the recognition results based on the fused feature also lack interpretability. To solve this problem, we propose a novel end-to-end trusted multi-granularity information fusion (TMGIF) model for weakly-supervised fine-grained image recognition. It can automatically extract multi-granularity information representation for a fine-grained image, further evaluate the quality of information granules, and then progressively fuse multi-granularity information according to the quality to obtain a reliable and interpretable recognition result. We evaluate TMGIF on three standard benchmark datasets, and demonstrate the proposed method can provide competitive results.
computer science, artificial intelligence
What problem does this paper attempt to address?