Propagating Image-Level Part Statistics to Enhance Object Detection

Sheng Gao,Joo-Hwee Lim,Qibin Sun
DOI: https://doi.org/10.1109/icip.2007.4379551
2007-01-01
Abstract:The bag-of-words approach has become increasingly attractive in the fields of object category recognition and scene classification, witnessed by some successful applications [5, 7, 11]. Its basic idea is to quantize an image using visual terms and exploit the image-level statistics for classification. However, the previous work still lacks the capability of modeling the spatial dependency and the correspondence between patches and object parts. Moreover, quantization always deteriorates the descriptive power of the patch feature. This paper proposes the hidden maximum entropy (HME) approach for modeling the object category. Each object is modeled by the parts, each having a Gaussian distribution. The spatial dependency and image-level statistics of parts are modeled through the maximum entropy approach. The model is learned by an EM-IIS (Expectation maximum embedded with improved iterative scaling) algorithm. Our experiments on the Caltech 101 dataset show that the relative reduction of equal error rate of 23.5% and relative improvement of AUC (area under ROC) of 22.0% are obtained when comparing the HME based system with the ME based baseline system.
What problem does this paper attempt to address?