Abstract:In this paper, we propose a weakly-supervised approach for 3D object detection, which makes it possible to train a strong 3D detector with position-level annotations (i.e. annotations of object centers and categories). In order to remedy the information loss from box annotations to centers, our method makes use of synthetic 3D shapes to convert the position-level annotations into virtual scenes with box-level annotations, and in turn utilizes the fully-annotated virtual scenes to complement the real labels. Specifically, we first present a shape-guided label-enhancement method, which assembles 3D shapes into physically reasonable virtual scenes according to the coarse scene layout extracted from position-level annotations. Then we transfer the information contained in the virtual scenes back to real ones by applying a virtual-to-real domain adaptation method, which refines the annotated object centers and additionally supervises the training of detector with the virtual scenes. Since the shape-guided label enhancement method generates virtual scenes by human-heuristic physical constraints, the layout of the fixed virtual scenes may be unreasonable with varied object combinations. To address this, we further present differentiable label enhancement to optimize the virtual scenes including object scales, orientations and locations in a data-driven manner. Moreover, we further propose a label-assisted self-training strategy to fully exploit the capability of detector. By reusing the position-level annotations and virtual scenes, we fuse the information from both domains and generate box-level pseudo labels on the real scenes, which enables us to directly train a detector in fully-supervised manner. Extensive experiments on the widely used ScanNet and Matterport3D datasets show that our approach surpasses current weakly-supervised and semi-supervised methods by a large margin, and achieves comparable detection performance with some popular fully-supervised methods with less than 5% of the labeling labor.

SS3D: Sparsely-Supervised 3D Object Detection from Point Cloud

Are Dense Labels Always Necessary for 3D Object Detection from Point Cloud?

3D-SSD: Learning Hierarchical Features from RGB-D Images for Amodal 3D Object Detection

Weakly Supervised 3D Object Detection from Point Clouds

SSC3OD: Sparsely Supervised Collaborative 3D Object Detection from LiDAR Point Clouds

Semi-Supervised 3d Object Detection Via Adaptive Pseudo-Labeling

Towards A Weakly Supervised Framework for 3D Point Cloud Object Detection and Annotation

A weakly supervised method for 3D object detection with partially annotated samples

Back to Reality: Learning Data-Efficient 3D Object Detector with Shape Guidance.

Point-DETR3D: Leveraging Imagery Data with Spatial Point Prior for Weakly Semi-supervised 3D Object Detection

Hierarchical Supervision and Shuffle Data Augmentation for 3D Semi-Supervised Object Detection

FS-3DSSN: an efficient few-shot learning for single-stage 3D object detection on point clouds

Weakly Supervised 3D Instance Segmentation without Instance-level Annotations

DQS3D: Densely-matched Quantization-aware Semi-supervised 3D Detection

VSRD: Instance-Aware Volumetric Silhouette Rendering for Weakly Supervised 3D Object Detection

ST3D: Self-training for Unsupervised Domain Adaptation on 3D Object Detection

ST3D++: Denoised Self-Training for Unsupervised Domain Adaptation on 3D Object Detection

ATF-3D: Semi-Supervised 3D Object Detection With Adaptive Thresholds Filtering Based on Confidence and Distance

SC3D: Label-Efficient Outdoor 3D Object Detection via Single Click Annotation

A CONCEPTUAL STUDY INTO THE POTENTIAL OF MAX-PHASE CERAMICS FOR SELF-HEALING OF CRACK DAMAGE

3DSSD: Point-based 3D Single Stage Object Detector