Image Annotation by Sentences Based on Word Sequence Blocks Building Model

Hong-bin ZHANG,Yi YIN,Dong-hong JI,Ya-feng REN
DOI: https://doi.org/10.15918/j.tbit1001-0645.2017.11.07
2017-01-01
Abstract:Based on image annotation by sentences,the cross-media correlations between the images and the texts were constructed to improve the information retrieval accuracy and users' retrieval experiences ultimately.The KDES model was applied to extract image features effectively and the MK-KDES features were obtained in turn by fusing the extracted features in the multiple kernel learning model to interpret the key visual characteristics of the images.A new natural language generation model named word sequence blocks building (WSBB) was designed to evaluate the semantic correlations between the words and the images.And several key words were summarized for generating sentences.According to the semantic correlations and syntactic mode constraints between words,many N gram word sequences were made up of those summarized words to better interpret the images.Finally,sentences were generated by inputting the N gram word sequences into the templates.Experimental results show that,the MK-KDES-1 feature can focus on describing the key texture and shape characteristics of the images,which helps to improve the BLEU-1 scores.Moreover,semantic correlation between the words is an important premise of improving the BLEU-2 scores as well as the syntactic mode constraints.
What problem does this paper attempt to address?