BSTS: A Weakly-Supervised Method for Semantic Learning of 3D Point Clouds

Yan Liu,Qingyong Hu,Yulan Guo
DOI: https://doi.org/10.1109/tcsvt.2024.3420150
IF: 5.859
2024-01-01
IEEE Transactions on Circuits and Systems for Video Technology
Abstract:Point cloud semantic understanding with fewer point-wise annotations is an ongoing challenge that has yet to be fully addressed in the literature. Although previous approaches have achieved some success with weak supervision, our research reveals that even basic bounding box annotations and subcloud-level tags can provide valuable information for point cloud semantic segmentation. We propose a framework using Bounding boxes and Subcloud-level Tags for Semantic Segmentation, named BSTS. Our method explores local topological structures and geometric priors within and outside bounding boxes to produce reliable pseudo labels. Once bounding boxes of instances are provided for a point cloud, raw points can be divided into three categories: potential foreground points, ambiguous points, and clear background points. To ensure the reliability of the pseudo labels derived from weak supervision, we utilized an Attention-based Self-Training (AST) pipeline and the Point Class Activation Maps (PCAMs) technique. Subsequently, the segmentation network is trained using the generated pseudo labels. Experiments are conducted on two widely used large-scale benchmarks, including S3DIS and ScanNet. Our method achieves competitive semantic performance with the fully-supervised counterpart via low-cost bounding box annotations and subcloud-level tags.
What problem does this paper attempt to address?