Three-dimensional Object Tracking Based on Perspective Scale Invariant Feature Transform Correspondences

Wei Chen,Luming Liang,Yuelong Zhao,Shu Chen
DOI: https://doi.org/10.1117/1.jei.26.3.033022
IF: 0.829
2017-01-01
Journal of Electronic Imaging
Abstract:Reconstructing three-dimensional (3-D) poses from matched feature correspondences is widely used in 3-D object tracking. The precision of correspondence matching plays a major role in the pose reconstruction. Without prior knowledge of the perspective camera model, state-of-the-art methods only deal with two-dimensional (2-D) planar affine transforms. An interest point's detector and descriptor [perspective scale invariant feature transform (SIFT)] is proposed to overcome the side effects of viewpoint changing, i.e., our detector is invariant to viewpoint changing. Perspective SIFT is detected by the SIFT approach, where the sample region is determined by projecting the original sample region to the image plane based on the established camera model. An iterative algorithm then modifies the pose of the tracked object and it generally converges to a 3-D perspective invariant point. The pose of the tracked object is finally estimated by the combination of template warping and perspective SIFT correspondences. Thorough evaluations are performed on two public databases, the Biwi Head Pose dataset and the Boston University dataset. Comparisons illustrate that the proposed keypoint's detector largely improves the tracking performance. (C) 2017 SPIE and IS&T
What problem does this paper attempt to address?