Abstract:In this work, we address the challenging task of few-shot and zero-shot 3D point cloud semantic segmentation. The success of few-shot semantic segmentation in 2D computer vision is mainly driven by the pre-training on large-scale datasets like imagenet. The feature extractor pre-trained on large-scale 2D datasets greatly helps the 2D few-shot learning. However, the development of 3D deep learning is hindered by the limited volume and instance modality of datasets due to the significant cost of 3D data collection and annotation. This results in less representative features and large intra-class feature variation for few-shot 3D point cloud segmentation. As a consequence, directly extending existing popular prototypical methods of 2D few-shot classification/segmentation into 3D point cloud segmentation won't work as well as in 2D domain. To address this issue, we propose a Query-Guided Prototype Adaption (QGPA) module to adapt the prototype from support point clouds feature space to query point clouds feature space. With such prototype adaption, we greatly alleviate the issue of large feature intra-class variation in point cloud and significantly improve the performance of few-shot 3D segmentation. Besides, to enhance the representation of prototypes, we introduce a Self-Reconstruction (SR) module that enables prototype to reconstruct the support mask as well as possible. Moreover, we further consider zero-shot 3D point cloud semantic segmentation where there is no support sample. To this end, we introduce category words as semantic information and propose a semantic-visual projection model to bridge the semantic and visual spaces. Our proposed method surpasses state-of-the-art algorithms by a considerable 7.90% and 14.82% under the 2-way 1-shot setting on S3DIS and ScanNet benchmarks, respectively.

Prototype Adaption and Projection for Few- and Zero-Shot 3D Point Cloud Semantic Segmentation

Dynamic Prototype Adaptation with Distillation for Few-shot Point Cloud Segmentation

Query-guided Support Prototypes for Few-shot 3D Indoor Segmentation

Pass3d: Precise And Accelerated Semantic Segmentation For 3d Point Cloud

Few-shot 3D Point Cloud Semantic Segmentation

Superpoint-guided Semi-supervised Semantic Segmentation of 3D Point Clouds

Learning from the Target: Dual Prototype Network for Few Shot Semantic Segmentation

APANet: Adaptive Prototypes Alignment Network for Few-Shot Semantic Segmentation

Boosting Few-shot 3D Point Cloud Segmentation via Query-Guided Enhancement

Subspace Prototype Guidance for Mitigating Class Imbalance in Point Cloud Semantic Segmentation

Affinity3D: Propagating Instance-Level Semantic Affinity for Zero-Shot Point Cloud Semantic Segmentation

Rethinking Few-shot 3D Point Cloud Semantic Segmentation

Generalized Few-Shot Point Cloud Segmentation Via Geometric Words

Bidirectional Feature Globalization for Few-shot Semantic Segmentation of 3D Point Cloud Scenes

CLIP-Driven Prototype Network for Few-Shot Semantic Segmentation

Self-Regularized Prototypical Network for Few-Shot Semantic Segmentation

Prototype-based Semantic Segmentation

3D Object Segmentation Using Cross-Window Point Transformer with Latent Semantic Boundary Guidance

Adaptive Prototype Learning and Allocation for Few-Shot Segmentation

Beyond singular prototype: A prototype splitting strategy for few-shot medical image segmentation

Prototype Mixture Models for Few-shot Semantic Segmentation