Abstract:In the field of point cloud representation learning, many self-supervised learning methods aim to address the issue of conventional supervised learning methods relying heavily on labeled data. Particularly in recent years, contrastive learning-based methods have gained an increasing popularity. However, most of the current contrastive learning methods solely rely on conventional random augmentation, limiting the effectiveness of representation learning. Moreover, to prevent model collapse, they construct positive and negative sample pairs or explicit clustering centers, which adds complexity to data preprocessing operations. To address these challenges effectively and achieve accurate point cloud classification and segmentation, we propose PointSL, a self-learning network for point clouds based on contrastive learning. PointSL incorporates a learnable point cloud augmentation (LPA) module, which transforms samples with high precision, significantly improving the augmentation effect. To further enhance feature discrimination, PointSL introduces a self-learning process along a refined feature predictor (FFP). This innovative approach leverages the attention mechanism to facilitate mutual feature prediction between pairs of point clouds, thereby continuously improving discriminant performance. Additionally, the network constructed a simple yet effective self-adaptive loss function that optimizes the entire network through gradient feedback. For pretraining, it is beneficial to obtain encoders with a better generalization and a higher accuracy. We evaluate PointSL on benchmark datasets such as ModelNet40, Sydney Urban Objects and ShapeNetPart. Experimental results demonstrate that PointSL outperforms state-of-the-art self-supervised methods and supervised counterparts, achieving exceptional performance in classification and segmentation tasks. Notably, on the Sydney Urban Objects and ModelNet40 datasets, PointSL achieves OA and AA metrics of 80.6%, 69.9%, 94.2% and 91.4%, respectively. On the ShapeNetPart dataset, PointSL achieves Inst.mIoU and Cls.mIoU metrics of 86.3% and 85.1%, respectively.

LPCL: Localized Prominence Contrastive Learning for Self-Supervised Dense Visual Pre-Training

Dense Contrastive Learning for Self-Supervised Visual Pre-Training

DenseCL: A Simple Framework for Self-Supervised Dense Visual Pre-Training

Multi-Level Contrastive Learning for Dense Prediction Task

Align Yourself: Self-supervised Pre-training for Fine-grained Recognition via Saliency Alignment.

Space-correlated Contrastive Representation Learning with Multiple Instances.

MaskCLIP: Masked Self-Distillation Advances Contrastive Language-Image Pretraining

Toward High Quality Facial Representation Learning

Mejigclu: more effective jigsaw clustering for unsupervised visual representation learning

Layer Grafted Pre-training: Bridging Contrastive Learning And Masked Image Modeling For Label-Efficient Representations

Dense Contrastive Visual-Linguistic Pretraining

HAPiCLR: heuristic attention pixel-level contrastive loss representation learning for self-supervised pretraining

Boost Supervised Pretraining for Visual Transfer Learning: Implications of Self-Supervised Contrastive Representation Learning.

A point cloud self-learning network based on contrastive learning for classification and segmentation

Dense Semantic Contrast for Self-Supervised Visual Representation Learning

A Novel Self-Learning Network Integrating Contrastive Learning, Perceptual Learning and Masked Image Modelling

Contrastive Localized Language-Image Pre-Training

ADCL: Adversarial Distilled Contrastive Learning on lightweight models for self-supervised image classification

Locality Preserving Property Constrained Contrastive Learning for Object Classification in SAR Imagery

Contrastive Object-level Pre-training with Spatial Noise Curriculum Learning

Self-supervised Cross-stage Regional Contrastive Learning for Object Detection.