Object Detection in Real Images

Dilip K. Prasad
DOI: https://doi.org/10.48550/arXiv.1302.5189
2013-02-21
Abstract:Object detection and recognition are important problems in computer vision. Since these problems are meta-heuristic, despite a lot of research, practically usable, intelligent, real-time, and dynamic object detection/recognition methods are still unavailable. We propose a new object detection/recognition method, which improves over the existing methods in every stage of the object detection/recognition process. In addition to the usual features, we propose to use geometric shapes, like linear cues, ellipses and quadrangles, as additional features. The full potential of geometric cues is exploited by using them to extract other features in a robust, computationally efficient, and less meta-heuristic manner. We also propose a new hierarchical codebook, which provides good generalization and discriminative properties. The codebook enables fast multi-path inference mechanisms based on propagation of conditional likelihoods, that make it robust to occlusion and noise. It has the capability of dynamic learning. We also propose a new learning method that has generative and discriminative learning capabilities, does not need large and fully supervised training dataset, and is capable of online learning. The preliminary work of detecting geometric shapes in real images has been completed. This preliminary work is the focus of this report. Future path for realizing the proposed object detection/recognition method is also discussed in brief.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The problem that this paper attempts to solve is object detection and recognition in real - life images. Although a great deal of research has been carried out in this field, in practical applications, methods that can achieve intelligent, real - time and dynamic object detection and recognition are still lacking. The paper proposes a new object detection and recognition method, aiming to improve the performance of existing methods at each stage of the object detection and recognition process. Specifically, in addition to conventional features, this method also introduces geometric shapes (such as linear cues, ellipses and quadrilaterals) as additional features, and makes full use of these geometric cues to extract other features in a more robust, computationally efficient and less heuristic way. In addition, the paper also proposes a new hierarchical codebook, which provides good generalization and discrimination abilities, supports a fast multi - path reasoning mechanism, is robust to occlusion and noise, and has the ability of dynamic learning. At the same time, a new learning method is also proposed, which has generative and discriminative learning abilities, does not require large - scale and fully - supervised training data sets, and can perform online learning. The preliminary work focuses on the detection of geometric shapes in real - life images, which is the focus of the report. The paper also briefly discusses the path to realize the proposed object detection and recognition method in the future.