LDA Model Combined Spatial Information for Visual Object Recognition Research

LI Yang,LIU Yang,GUO Maozu
DOI: https://doi.org/10.3969/j.issn.2095-2163.2013.04.008
2013-01-01
Abstract:In recently years,many scholars introduce the LDA model which is widely used in nature language processing into visual object recognition,object segmentation,scene classification and so on.LDA model is a novel generative model,so there must be common defect between generative models that it assumes latent topic assignments of different visual words are conditionally independent.According to the characteristics of images,spatial information of the images plays an important role in image object recognition,that is to say,the generation process of the latent topics given the visual words is influenced by its adjacent visual words' latent topics.In order to improve the accuracy of the distribution of the topics given the visual words,the paper proposes the LDA model combined spatial information,namely LDA model combined CRF,which is fused the 2D image spatial information in the local latent topic label to avoid the loss of spatial information and can improve the accuracy of the distribution of the latent topics.The main research contents of this paper: firstly,improve the LDA model,and combine the conditional random field into LDA model,and derive the model parameters using the corresponding EM algorithm.This paper uses the conditional random fields for getting 2D spatial information of the images;combines the generative model and the discriminative model.The paper enhances the spatial correlation of the latent topic labels of the adjacent visual words determined by the images' nature characteristic,at the same time,improves the recognition rate of the visual objects.
What problem does this paper attempt to address?