Fine-grained Image Classification by Visual-Semantic Embedding

Huapeng Xu,Guilin Qi,Jingjing Li,Meng Wang,Kang Xu,Huan Gao
DOI: https://doi.org/10.24963/ijcai.2018/145
2018-01-01
Abstract:This paper investigates a challenging problem, which is known as fine-grained image classification (FGIC). Different from conventional computer vision problems, FGIC suffers from the large intraclass diversities and subtle inter-class differences. Existing FGIC approaches are limited to explore only the visual information embedded in the images. In this paper, we present a novel approach which can use handy prior knowledge from either structured knowledge bases or unstructured text to facilitate FGIC. Specifically, we propose a visual-semantic embedding model which explores semantic embedding from knowledge bases and text, and further trains a novel end-to-end CNN framework to linearly map image features to a rich semantic embedding space. Experimental results on a challenging large-scale UCSD Bird-200-2011 dataset verify that our approach outperforms several state-of-the-art methods with significant advances.
What problem does this paper attempt to address?