CP-Net: Contour-Perturbed Reconstruction Network for Self-Supervised Point Cloud Learning

Mingye Xu,Yali Wang,Zhipeng Zhou,Hongbin Xu,Yu Qiao
DOI: https://doi.org/10.48550/arXiv.2201.08215
2022-03-28
Abstract:Self-supervised learning has not been fully explored for point cloud analysis. Current frameworks are mainly based on point cloud reconstruction. Given only 3D coordinates, such approaches tend to learn local geometric structures and contours, while failing in understanding high level semantic content. Consequently, they achieve unsatisfactory performance in downstream tasks such as classification, segmentation, etc. To fill this gap, we propose a generic Contour-Perturbed Reconstruction Network (CP-Net), which can effectively guide self-supervised reconstruction to learn semantic content in the point cloud, and thus promote discriminative power of point cloud representation. First, we introduce a concise contour-perturbed augmentation module for point cloud reconstruction. With guidance of geometry disentangling, we divide point cloud into contour and content components. Subsequently, we perturb the contour components and preserve the content components on the point cloud. As a result, self supervisor can effectively focus on semantic content, by reconstructing the original point cloud from such perturbed one. Second, we use this perturbed reconstruction as an assistant branch, to guide the learning of basic reconstruction branch via a distinct dual-branch consistency loss. In this case, our CP-Net not only captures structural contour but also learn semantic content for discriminative downstream tasks. Finally, we perform extensive experiments on a number of point cloud benchmarks. Part segmentation results demonstrate that our CP-Net (81.5% of mIoU) outperforms the previous self-supervised models, and narrows the gap with the fully-supervised methods. For classification, we get a competitive result with the fully-supervised methods on ModelNet40 (92.5% accuracy) and ScanObjectNN (87.9% accuracy). The codes and models will be released afterwards.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The problem that this paper attempts to solve is that in point - cloud analysis, self - supervised learning methods perform poorly in understanding and differentiating high - level semantic content in point clouds. Existing self - supervised learning frameworks are mainly based on point - cloud reconstruction. These methods tend to only learn local geometric structures and contours, and are unable to understand high - level semantic content, resulting in unsatisfactory performance in downstream tasks such as classification and segmentation. Therefore, the paper proposes a general Contour - Perturbed Reconstruction Network (CP - Net), aiming to learn the semantic content in point clouds by guiding self - supervised reconstruction, thereby improving the discriminative ability of point - cloud representations. Specifically, CP - Net conducts point - cloud reconstruction by introducing a concise contour - perturbation enhancement module. This module first divides the point cloud into contour and content components, then perturbs the contour component and retains the content component. In this way, the self - supervisor can focus more effectively on the learning of semantic content. In addition, the paper also designs a two - branch consistency loss, using an auxiliary branch to guide the learning of the basic branch, so that CP - Net can not only capture structural contours, but also learn semantic content for discriminative downstream tasks. Experimental results show that CP - Net has achieved significant performance improvements in partial segmentation and classification tasks on multiple point - cloud benchmark datasets.