TG-Pose: Delving Into Topology and Geometry for Category-Level Object Pose Estimation
Yue Zhan,Xin Wang,Lang Nie,Yang Zhao,Tangwen Yang,Qiuqi Ruan
DOI: https://doi.org/10.1109/tmm.2024.3398291
IF: 7.3
2024-09-21
IEEE Transactions on Multimedia
Abstract:Category-level 6D object pose estimation aims to estimate the pose and size of unseen objects with known categories. Existing methods mainly focus on capturing geometric features to handle shape variations, and are prone to failure in occlusion and noisy environments. In this paper, we propose TG-Pose, a unified pose estimation framework that delves into topology and geometry to deal with the above issues. To exploit topological properties, we first propose a topological feature predictor and a topological label generator to dig into the underlying structural details from encoded features using persistent homology. Then, the topological and geometric features are employed to facilitate the symmetry reconstruction of the original point cloud to obtain a reliable and coherent object shape, which, in turn, guides the pose estimation. For each object category, we construct geometric and topological templates by leveraging inherent intra-class similarities. These templates enhance the reliability of pose estimation and the completeness of object structure through geometric alignment and topological guidance, especially when handling incomplete objects. Moreover, a pose-aware enhancement strategy is designed to enhance the encoder in learning pose-sensitive features and robustness to noisy point clouds. Experimental results show that TG-Pose outperforms the State-of-the-Art solutions on public benchmarks and achieves better generalization in real-world datasets.
computer science, information systems,telecommunications, software engineering