Object proposal generation applying the distance dependent Chinese restaurant process

Mikko Lauri,Simone Frintrop
DOI: https://doi.org/10.48550/arXiv.1704.03706
2017-04-12
Abstract:In application domains such as robotics, it is useful to represent the uncertainty related to the robot's belief about the state of its environment. Algorithms that only yield a single "best guess" as a result are not sufficient. In this paper, we propose object proposal generation based on non-parametric Bayesian inference that allows quantification of the likelihood of the proposals. We apply Markov chain Monte Carlo to draw samples of image segmentations via the distance dependent Chinese restaurant process. Our method achieves state-of-the-art performance on an indoor object discovery data set, while additionally providing a likelihood term for each proposal. We show that the likelihood term can effectively be used to rank proposals according to their quality.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how to generate object proposals more effectively in the field of robot applications and, at the same time, quantify the uncertainty of these proposals. Specifically, the author proposes a method based on the distance - dependent Chinese restaurant process (ddCRP) to generate object proposals and uses non - parametric Bayesian inference to quantify the likelihood of these proposals. This method not only improves the diversity of proposals but also can calculate the likelihood value of each proposal, so that they can be ranked according to the quality of the proposals. This is especially important in robot grasping tasks because robots need to make decisions based on input image data, for example, selecting the proposal that is most likely to correspond to the actual object as the target for grasping. ### Main Contributions 1. **Method Innovation**: Proposed an object proposal generation method based on the distance - dependent Chinese restaurant process (ddCRP), generating proposals from the posterior distribution through Markov chain Monte Carlo (MCMC) sampling. 2. **Quantifying Uncertainty**: Not only generated object proposals but also calculated the likelihood value of each proposal, which helps to evaluate the quality of the proposals. 3. **Performance Improvement**: Achieved state - of - the - art performance on the indoor object discovery dataset and provided the likelihood value of each proposal, further improving the quality of the proposals. 4. **Application Prospects**: The proposed likelihood value can be used for robots to plan their actions and improve the task success rate. ### Technical Details - **Distance - Dependent Chinese Restaurant Process (ddCRP)**: Generate image segmentation samples by defining the link probability between superpixels. - **Markov Chain Monte Carlo (MCMC) Sampling**: Draw samples from the ddCRP posterior distribution to generate multiple segmentation hypotheses. - **Proposal Extraction and Ranking**: Extract unique object proposals from multiple segmentation samples and rank the proposals according to the likelihood value and Gestalt principles. ### Experimental Results - **Dataset**: Used the Kitchen Object Discovery (KOD) dataset, which contains multiple challenging cluttered scenes. - **Performance Evaluation**: Evaluated the performance of the method through precision and recall, and the results showed that this method outperformed existing methods on multiple sequences, especially when using the likelihood - value - weighted scoring method. ### Conclusion The method proposed in the paper not only reaches the state - of - the - art level in performance but also provides an effective way to quantify the uncertainty of object proposals, which is of great significance for decision - support in robot applications. Future work could consider extending this method to RGB - D data and analyzing the parameter dependencies.