Graph-based Stereo Matching by Incorporating Monocular Cues

Xiangyin Ma,Hongbin Zha
2008-01-01
Abstract:Stereo vision is one of the most intensive and challenging problems in computer vision. It makes use of stereo cues to extract 3D information from 2D images. Besides stereo cues, there are statistical learning based approaches which exploit monocular cues to predict the underlying 3D structure. As for human vision, the amazing ability for 3D interpretation is based on the combination of these two kinds of cues. Therefore, in this paper, we make an attempt to incorporate monocular cues into the stereo matching system. A two-level graph is designed to fuse the low-resolution monocular cues and high-resolution stereo cues together. Then the optimal labeling results are calculated via graph-cuts. We choose the scenes of buildings to test our algorithm, which have relatively large disparity ranges and poor textures. The experiment results show that we can obtain more accurate disparity map than is possible using either monocular or stereo cues alone.
What problem does this paper attempt to address?