Weighted Hough Voting For Multi-View Car Detection

Tao Xiang,Zuomei Lai,Wensheng Qiao,Tao Li
DOI: https://doi.org/10.23919/ICIF.2017.8009658
2017-01-01
Abstract:Hough voting based methods for object detection work by means of allowing local image patches to vote for the center of the object according to the trained visual words. They are effective for object with small local varieties, but incapable of solving multi-view detection problem. The traditional way is training visual words for each subcategory that has similar view. However, limited training data prevents this from being effective. In this paper, we propose an extension to the Hough voting which allows for sharing visual words among multiple subcategories and accumulating votes with discriminative combination weights for different subcategories. The shared visual words are learned using dense image patches. Having such visual words, we can collect descriptors of samples in all subcategories and negative set to train the discriminative combination weights. The final score of a hypothesis is the maximum one in all discretized views. By fusing the geometry structure, image appearance and view information of the object, multi-view object detection problem is solved effectively. In this paper, we mainly focus on multi-view car detection, but not limited to. The proposed method is evaluated on 2 well-known datasets: MIT StreetScene Cars dataset and PASCAL VOC2007 car dataset. The experimental results demonstrate that our method achieves state-of-the-art or competitive performance.
What problem does this paper attempt to address?