IMAGE CAPTION BASED ON IMAGE SEMANTIC SIMILARITY NETWORK

Chang Liu,Xiangdong Zhou,Bole Shi
DOI: https://doi.org/10.3969/j.issn.1000-386x.2018.01.037
2018-01-01
Abstract:Image caption solves the problem of high level semantic image understanding.Due to the semantic gap,there are differences between the generative captions and images,and the shallow neural network based language model is hard to generate smooth sentences.Tosolve this problem,this paper proposes image semantic similarity neural network,which adds full connected layers after the output layer of recurrent neural network,and thus introduce the visual similarity and semantic similarity information between images.Besides,this method increases the depth of stacked hidden layers and common hidden layers to improve the learning ability of language model,and finally obtain image captions close to human natural language.Experiments show that the proposed method get great respond from BLEU,ROUGE,METEOR and CIDEr assessment criteria by obtaining high quality captions.
What problem does this paper attempt to address?