Semantic Labeling of ALS Point Cloud Via Learning Voxel and Pixel Representations

Nannan Qin,Xiangyun Hu,Puzuo Wang,Jie Shan,Yijing Li
DOI: https://doi.org/10.1109/lgrs.2019.2931119
IF: 5.343
2020-01-01
IEEE Geoscience and Remote Sensing Letters
Abstract:Semantic labeling is a fundamental task that can provide useful semantics for many other 3-D processing tasks. To tackle the challenge of airborne laser scanning (ALS) point cloud classification, current state-of-the-art methods leverage the capabilities of deep learning. However, they are limited due to the weaknesses of the isolated use of individual representations of point clouds. To address this issue, this letter presents a novel network, VPNet, which ensembles voxel and pixel representation-based networks, to predict class probabilities for each light detection and ranging (LiDAR) point. A fully connected conditional random field-based global refinement is then performed over each point in the point cloud to produce a fine-grained classification result. On the ISPRS 3-D Semantic Labeling Contest, our solution sets a new state of the art by improving the highest average F1-score and the highest average per-class accuracy from 69.3% to 73.9%, and 69.0% to 74.9%, respectively. The overall accuracy of our approach is 84.0%.
What problem does this paper attempt to address?