Aesthetic image captioning on the FAE-Captions dataset

Xin Jin,Jianwen Lv,Xinghui Zhou,Chaoen Xiao,Xiaodong Li,Shu Zhao
DOI: https://doi.org/10.1016/j.compeleceng.2022.107866
2022-07-01
Abstract:At present, most of the research on image aesthetics focuses on scoring pictures. We propose Aesthetic Assessment of Images, which means the dense aesthetic captioning. The image captioning model uses many photos for www.flickr.com as a training set, including lots of captions about every photo meantime. Then we use LDA for aesthetic topics and active learning to screen sentence. The remaining sentence dataset, we called FAE-Captions, is a professional dataset on the topic of image aesthetics. Put the dataset into a CNN-LSTM model for predicting and generating answers. In order to let neural network connected with aesthetics, we propose an aesthetic loss function for many aesthetics topics additionally.
engineering, electrical & electronic,computer science, interdisciplinary applications, hardware & architecture
What problem does this paper attempt to address?