Weakly supervised semantic segmentation of mobile laser scanning point clouds via category balanced random annotation and deep consistency‐guided self‐distillation mechanism
Jiacheng Liu,Haiyan Guan,Xiangda Lei,Yongtao Yu
DOI: https://doi.org/10.1111/phor.12468
2023-12-03
The Photogrammetric Record
Abstract:This paper explores a novel weakly supervised learning (WSL) framework for semantic segmentation of mobile laser scanning (MLS) point clouds. Specifically, a category balanced random annotation (CBRA) strategy is employed to obtain balanced labels and enhance model performance. Next, a deep consistency‐guided self‐distillation (DCS) mechanism is developed by transferring knowledge among different deep networks to exploit semantically rich information. By using only 0.1% labelled points, the proposed WSL framework achieved a competitive performance on three different point cloud datasets. Scene understanding of mobile laser scanning (MLS) point clouds is vital in autonomous driving and virtual reality. Most existing semantic segmentation methods rely on a large number of accurately labelled points, which is time‐consuming and labour‐intensive. To cope with this issue, this paper explores a weakly supervised learning (WSL) framework for MLS data. Specifically, a category balanced random annotation (CBRA) strategy is employed to obtain balanced labels and enhance model performance. Next, based on KPConv‐Net as a backbone network, a WSL semantic segmentation framework is developed for MLS point clouds via a deep consistency‐guided self‐distillation (DCS) mechanism. The DCS mechanism consists of a deep consistency‐guided self‐distillation branch and an entropy regularisation branch. The self‐distillation branch is designed by constructing an auxiliary network to maintain the consistency of predicted distributions between the auxiliary network and the original network, while the entropy regularisation branch is designed to increase the confidence of the network predicted results. The proposed WSL framework was evaluated on the WHU‐MLS, NPM3D and Toronto3D datasets. By using only 0.1% labelled points, the proposed WSL framework achieved a competitive performance in MLS point cloud semantic segmentation with the mean Intersection over Union (mIoU) scores of 60.08%, 72.0% and 67.42% on the three datasets, respectively.
geosciences, multidisciplinary,geography, physical,remote sensing,imaging science & photographic technology