Three-Filters-to-Normal$+$: Revisiting Discontinuity Discrimination in Depth-to-Normal Translation
Jingwei Yang,Bohuan Xue,Yi Feng,Deming Wang,Rui Fan,Qijun Chen
DOI: https://doi.org/10.1109/tase.2024.3355941
IF: 6.636
2024-01-01
IEEE Transactions on Automation Science and Engineering
Abstract:This article introduces three-filters-to-normal $+$ (3F2N $+$ ), an extension of our previous work three-filters-to-normal (3F2N), with a specific focus on incorporating discontinuity discrimination capability into surface normal estimators (SNEs). 3F2N $+$ achieves this capability by utilizing a novel discontinuity discrimination module (DDM), which combines depth curvature minimization and correlation coefficient maximization through conditional random fields (CRFs). To evaluate the robustness of SNEs on noisy data, we create a large-scale synthetic surface normal (SSN) dataset containing 20 scenarios (ten indoor scenarios and ten outdoor scenarios with and without random Gaussian noise added to depth images). Extensive experiments demonstrate that 3F2N $+$ achieves greater performance than all other geometry-based surface normal estimators, with average angular errors of 7.85 $^\circ$ , 8.95 $^\circ$ , 9.25 $^\circ$ , and 11.98 $^\circ$ on the clean-indoor, clean-outdoor, noisy-indoor, and noisy-outdoor datasets, respectively. We conduct three additional experiments to demonstrate the effectiveness of incorporating our proposed 3F2N $+$ into downstream robot perception tasks, including freespace detection, 6D object pose estimation, and point cloud completion. Our source code and datasets are publicly available at https://mias.group/3F2Nplus. Note to Practitioners—The primary motivation behind this work arises from the need to develop a high-performing surface normal estimator for practical robotics and computer vision applications. While geometry-based surface normal estimators have been widely used in these domains, the existing solutions focus merely on discontinuity discrimination. To tackle this problem, this article introduces a plug-and-play module that leverages both depth curvature and correlation coefficient to quantify discontinuity levels, thereby optimizing surface normal estimation, particularly near or on discontinuous regions. Moreover, this article also introduces a large-scale public dataset with random noise added to depth images, providing a more realistic and robust platform for algorithm evaluation within this research community. Extensive experimental results demonstrate that our method outperforms other state-of-the-art algorithms.
automation & control systems