Image Emotion Caption Based on Visual Attention Mechanisms

Bo Li,Yuqing Zhou,Hui Ren
DOI: https://doi.org/10.1109/iccc51575.2020.9344900
2020-01-01
Abstract:Nowadays, most image automatic descriptions are based on objective descriptions and not contain subjective emotions. However, when people observe images, they will have certain subjective feelings. This paper proposes an image emotion description system based on attention mechanism, which automatically generates captions with sentiments. In the encoding stage, we use CNN to extract image features. In the decoding description phase, we use two same direction parallel LSTMs- - one represents factual descriptions; another specializes in descriptions with emotion. When attention mechanism detects different regions of the image, whether emotional words need to be output is selected by a switching mechanism. This system makes the image description more vivid and ensures the accuracy of the description.
What problem does this paper attempt to address?