A Bayesian Model for Scene Classification with a Visual Grammar

Xiaoru Wang,Jie Liu
DOI: https://doi.org/10.1109/icbnmt.2011.6155947
2011-01-01
Abstract:As the size of image databases for various applications keeps growing at an explosive rate, how to automatically and efficiently classify the images and mimic human visual perception is an extremely important issue. The spatial layout information of the image is a kind of abstract semantics and the visual grammar of the image. It could dramatically improve the effectiveness of the scene classification. We proposed a Bayesian framework based on visual grammar that aims to reduce the gap between low-level features and high-level semantics. This algorithm uses a semantic-based image segmentation algorithm to get the major regions and uses Bayesian framework to translate the region into object word and also build the object database. It builds a visual grammar by leveraging the spatial layout of the scene image. A visual vocabulary with grammar constraint for the scene classification is formed via a Bayesian learning model. The experiments show that our algorithm is simple and effective. The result of the classification meets the human visual perception well.
What problem does this paper attempt to address?