Image Captioning With Relational Knowledge

Huan Yang,Dandan Song,Lejian Liao
DOI: https://doi.org/10.1007/978-3-319-97310-4_43
2018-01-01
Abstract:People have learned extensive relational knowledge from daily life. This is one of the facts that enables human to describe the information from images easily. In this paper, we propose a novel framework called Image Captioning with Relational Knowledge (ICRK) that combines relational knowledge with image captioning model and utilizes relational knowledge to strengthen the learning process of representing words. As more precise syntactic and semantic word relationships were learned, the image captioning model acquires more semantic features that help to generate more accurate image descriptions. Experiments on several benchmark datasets, using automatic evaluation metrics, have all demonstrated that our model can significantly improve the quality of image captioning.
What problem does this paper attempt to address?