CLR-GAM: Contrastive Point Cloud Learning with Guided Augmentation and Feature Mapping

Srikanth Malla,Yi-Ting Chen
DOI: https://doi.org/10.48550/arXiv.2302.14306
2023-02-28
Abstract:Point cloud data plays an essential role in robotics and self-driving applications. Yet, annotating point cloud data is time-consuming and nontrivial while they enable learning discriminative 3D representations that empower downstream tasks, such as classification and segmentation. Recently, contrastive learning-based frameworks have shown promising results for learning 3D representations in a self-supervised manner. However, existing contrastive learning methods cannot precisely encode and associate structural features and search the higher dimensional augmentation space efficiently. In this paper, we present CLR-GAM, a novel contrastive learning-based framework with Guided Augmentation (GA) for efficient dynamic exploration strategy and Guided Feature Mapping (GFM) for similar structural feature association between augmented point clouds. We empirically demonstrate that the proposed approach achieves state-of-the-art performance on both simulated and real-world 3D point cloud datasets for three different downstream tasks, i.e., 3D point cloud classification, few-shot learning, and object part segmentation.
Computer Vision and Pattern Recognition,Artificial Intelligence,Machine Learning
What problem does this paper attempt to address?
The main problems that this paper attempts to solve are two limitations of existing contrastive learning methods when dealing with 3D point cloud data: 1. **Contrast Ambiguity**: - **Global Contrast Ambiguity (GCA)**: Through augmentation operations such as cropping and non - rigid transformation, the shape of the augmented object may be completely different from the original object, leading to confusion in contrastive learning. For example, if the back part is removed from a "chair" point cloud, the result may be similar to a sample in the "table" category. - **Local Contrast Ambiguity (LCA)**: Local feature contrast techniques regard the features of other points in the same point cloud as negative samples, but due to symmetry and similar shapes within the object, these features may actually be similar. 2. **Curse of Dimensionality**: - Contrastive learning requires multiple augmentation operations to learn discriminative 3D point cloud representations, but exploring the high - dimensional augmentation space is time - consuming and it is difficult to ensure coverage. Existing methods usually adopt random sampling, which leads to ineffective augmentation and high computational cost. To solve these problems, the authors propose a new contrastive learning framework **CLR - GAM**, which contains two main modules: 1. **Guided Feature Mapping (GFM)**: - It is used to correlate the features of the same structure between two augmented point clouds, and can effectively correlate features even in the case of severely deformed shapes. - By tightly coupling local feature correlation and global feature contrast, it improves the accuracy of feature representation. 2. **Guided Augmentation (GA)**: - It efficiently explores the high - dimensional augmentation space through a dynamic exploration strategy, ensuring that a diverse range of augmentation operations can be covered with a limited number of samples. - It uses a memory bank to save the explored augmented samples and selects novel augmentation operations by calculating the distance between new samples and existing samples. Through these two modules, CLR - GAM can more effectively learn discriminative 3D point cloud representations and has achieved state - of - the - art performance in multiple downstream tasks, including 3D point cloud classification, few - shot learning, and object part segmentation.