Abstract:In the field of point cloud representation learning, many self-supervised learning methods aim to address the issue of conventional supervised learning methods relying heavily on labeled data. Particularly in recent years, contrastive learning-based methods have gained an increasing popularity. However, most of the current contrastive learning methods solely rely on conventional random augmentation, limiting the effectiveness of representation learning. Moreover, to prevent model collapse, they construct positive and negative sample pairs or explicit clustering centers, which adds complexity to data preprocessing operations. To address these challenges effectively and achieve accurate point cloud classification and segmentation, we propose PointSL, a self-learning network for point clouds based on contrastive learning. PointSL incorporates a learnable point cloud augmentation (LPA) module, which transforms samples with high precision, significantly improving the augmentation effect. To further enhance feature discrimination, PointSL introduces a self-learning process along a refined feature predictor (FFP). This innovative approach leverages the attention mechanism to facilitate mutual feature prediction between pairs of point clouds, thereby continuously improving discriminant performance. Additionally, the network constructed a simple yet effective self-adaptive loss function that optimizes the entire network through gradient feedback. For pretraining, it is beneficial to obtain encoders with a better generalization and a higher accuracy. We evaluate PointSL on benchmark datasets such as ModelNet40, Sydney Urban Objects and ShapeNetPart. Experimental results demonstrate that PointSL outperforms state-of-the-art self-supervised methods and supervised counterparts, achieving exceptional performance in classification and segmentation tasks. Notably, on the Sydney Urban Objects and ModelNet40 datasets, PointSL achieves OA and AA metrics of 80.6%, 69.9%, 94.2% and 91.4%, respectively. On the ShapeNetPart dataset, PointSL achieves Inst.mIoU and Cls.mIoU metrics of 86.3% and 85.1%, respectively.

Learning Transformation-Predictive Representations for Detection and Description of Local Features.

Point Contrastive Prediction with Semantic Clustering for Self-Supervised Learning on Point Cloud Videos

Leveraging Local Planar Motion Property for Robust Visual Matching and Localization.

Self-Supervised Global-Local Structure Modeling for Point Cloud Domain Adaptation with Reliable Voted Pseudo Labels

Graph-Based Contrastive Learning for Description and Detection of Local Features.

Local Representation is Not Enough: Soft Point-Wise Transformer for Descriptor and Detector of Local Features.

Rethinking Low-Level Features for Interest Point Detection and Description.

Self-Contrastive Learning with Hard Negative Sampling for Self-supervised Point Cloud Learning

P4Contrast: Contrastive Learning with Pairs of Point-Pixel Pairs for RGB-D Scene Understanding

Digging Into Self-Supervised Learning of Feature Descriptors

Learning multi-view visual correspondences with self-supervision

Learning Task-Aligned Local Features for Visual Localization

Improving Contrastive Learning by Visualizing Feature Transformation

Distillation with Contrast is All You Need for Self-Supervised Point Cloud Representation Learning

A point cloud self-learning network based on contrastive learning for classification and segmentation

PointCMP: Contrastive Mask Prediction for Self-supervised Learning on Point Cloud Videos

GroupContrast: Semantic-aware Self-supervised Representation Learning for 3D Understanding

Self-supervised Latent Feature Learning for Partial Point Clouds Recognition

Deep Self-Taught Learning for Weakly Supervised Object Localization

Reinforced Feature Points: Optimizing Feature Detection and Description for a High-Level Task

PUW-Feat: A Progressive and Unified Method for Weakly Supervised Local Feature Learning.