Abstract:In the field of point cloud representation learning, many self-supervised learning methods aim to address the issue of conventional supervised learning methods relying heavily on labeled data. Particularly in recent years, contrastive learning-based methods have gained an increasing popularity. However, most of the current contrastive learning methods solely rely on conventional random augmentation, limiting the effectiveness of representation learning. Moreover, to prevent model collapse, they construct positive and negative sample pairs or explicit clustering centers, which adds complexity to data preprocessing operations. To address these challenges effectively and achieve accurate point cloud classification and segmentation, we propose PointSL, a self-learning network for point clouds based on contrastive learning. PointSL incorporates a learnable point cloud augmentation (LPA) module, which transforms samples with high precision, significantly improving the augmentation effect. To further enhance feature discrimination, PointSL introduces a self-learning process along a refined feature predictor (FFP). This innovative approach leverages the attention mechanism to facilitate mutual feature prediction between pairs of point clouds, thereby continuously improving discriminant performance. Additionally, the network constructed a simple yet effective self-adaptive loss function that optimizes the entire network through gradient feedback. For pretraining, it is beneficial to obtain encoders with a better generalization and a higher accuracy. We evaluate PointSL on benchmark datasets such as ModelNet40, Sydney Urban Objects and ShapeNetPart. Experimental results demonstrate that PointSL outperforms state-of-the-art self-supervised methods and supervised counterparts, achieving exceptional performance in classification and segmentation tasks. Notably, on the Sydney Urban Objects and ModelNet40 datasets, PointSL achieves OA and AA metrics of 80.6%, 69.9%, 94.2% and 91.4%, respectively. On the ShapeNetPart dataset, PointSL achieves Inst.mIoU and Cls.mIoU metrics of 86.3% and 85.1%, respectively.

PointUR-RL: Unified Self-Supervised Learning Method Based on Variable Masked Autoencoder for Point Cloud Reconstruction and Representation Learning

Masked Autoencoders for Point Cloud Self-supervised Learning.

Point Contrastive Prediction with Semantic Clustering for Self-Supervised Learning on Point Cloud Videos

Point Cloud Domain Adaptation Via Masked Local 3D Structure Prediction

Multi-Angle Point Cloud-VAE: Unsupervised Feature Learning for 3D Point Clouds from Multiple Angles by Joint Self-Reconstruction and Half-to-Half Prediction

Point‐AGM : Attention Guided Masked Auto‐Encoder for Joint Self‐supervised Learning on Point Clouds

Point Cloud Self-supervised Learning via 3D to Multi-view Masked Autoencoder

RI-MAE: Rotation-Invariant Masked AutoEncoders for Self-Supervised Point Cloud Representation Learning

LR-MAE: Locate While Reconstructing with Masked Autoencoders for Point Cloud Self-supervised Learning

Masked Autoencoders in 3D Point Cloud Representation Learning

Inter-Modal Masked Autoencoder for Self-Supervised Learning on Point Clouds

Upsampling Autoencoder for Self-Supervised Point Cloud Learning

Regress Before Construct: Regress Autoencoder for Point Cloud Self-supervised Learning

Bringing Masked Autoencoders Explicit Contrastive Properties for Point Cloud Self-Supervised Learning

Self-supervised Point Cloud Representation Learning Via Separating Mixed Shapes

PointMoment:Mixed-Moment-based Self-Supervised Representation Learning for 3D Point Clouds

PointCG: Self-supervised Point Cloud Learning via Joint Completion and Generation

Masked Local-Global Representation Learning for 3D Point Cloud Domain Adaptation

A point cloud self-learning network based on contrastive learning for classification and segmentation

A Simple Masked Autoencoder Paradigm for Point Cloud

GeoMAE: Masked Geometric Target Prediction for Self-supervised Point Cloud Pre-Training