Image Captioning with Scene-graph Based Semantic Concepts.

Lizhao Gao,Bo Wang,Wenmin Wang
DOI: https://doi.org/10.1145/3195106.3195114
2018-01-01
Abstract:Different from existing approaches for image captioning, in this paper, we explore the co-occurrence dependency of high-level semantic concepts and propose a novel method with scene-graph based semantic representation for image captioning. To embed scene graph as an intermediate state, we divide the task of image captioning into two phases, called concept cognition and sentence construction respectively. We build a vocabulary of semantic concepts and propose a CNN-RNN-SVM framework to generate the scene-graph-based sequence, which is then transformed into a bit vector, as the input of RNN in the next phase. We evaluate our method on MS COCO dataset. Experimental results show that our approaches obtain a competitive or superior result to the state-of-the-arts.
What problem does this paper attempt to address?