Learning Hybrid Image Templates (HIT) by Information Projection.

Zhangzhang Si,Song-Chun Zhu
DOI: https://doi.org/10.1109/TPAMI.2011.227
2012-01-01
Abstract:This paper presents a novel framework for learning a generative image representation - the hybrid image template (HIT) from a small number of image examples. Each learned template is composed of, typically, 50~500 image patches whose geometric attributes may adapt in a local neighborhood for deformation, and whose appearances are characterized respectively by four types of descriptors: local sketch, texture gradients with orientations, flatness regions, and colors. These heterogeneous patches are automatically ranked and selected from a large pool according to their information gains using an information projection framework. Intuitively, a patch has a higher information gain if (i) its feature statistics is consistent within the training examples and is distinctive from the statistics of negative examples; and (ii) its feature statistics has less intra-class variations. The learning process pursues the most informative patches and stops when the information gain is within statistical fluctuation. This automated feature selection procedure allows our algorithm to scale up to a wide range of image categories, from those with regular shapes to those with stochastic texture. The learned representation captures the intrinsic characteristics of the object or scene categories. We evaluate the HITs on public benchmarks, and demonstrate classification performances on par with state-of-art methods like HoG+SVM.
What problem does this paper attempt to address?