A Dual Attention KPConv Network Combined With Attention Gates for Semantic Segmentation of ALS Point Clouds

Jinbiao Zhao,Hangyu Zhou,Feifei Pan
DOI: https://doi.org/10.1109/tgrs.2024.3422829
IF: 8.2
2024-07-13
IEEE Transactions on Geoscience and Remote Sensing
Abstract:Kernel point convolution (KPConv) defines convolutional weights based on Euclidean distances between kernel points and input points and has shown good segmentation results on several datasets. However, it does not consider the intrinsic connection between input points and features, which is crucial for the semantic segmentation of airborne laser scanning (ALS) point clouds with sparse density and complex backgrounds. To address this problem, we design a dual attention KPConv network (DAKAG-Net) combined with attention gates for semantic segmentation of ALS point clouds. Specifically, we design the channel and spatial attention KPConv (CSAKPConv) block in the encoding process, which first performs adaptive feature refinement of the input mapping along two separate dimensions, channel and spatial, and then performs kernel point convolution. In addition, to enhance the use of high-level semantic information and detect objects of varying sizes, DAKAG-Net incorporates multiple attention gates (MAGs) that merge the lowest-level features, skip-connected features, and corresponding upsampled features during the decoding process. The decoded features are ultimately convolved with convolution kernels of various sizes and then merged to acquire multiscale perceptual field features. The proposed DAKAG-Net improves the OA, mF1, and mIoU by 3.5%, 3.1%, and 3.5%, respectively, compared with the baseline results on the ISPRS 3-D dataset, and yields the segmentation accuracy rates of 85.2% (OA), 73.7% (mF1), and 61.2% (mIoU). Moreover, the DAKAG-Net also obtains new state-of-the-art segmentation results on the DFC2019 dataset and the LASDU dataset.
engineering, electrical & electronic,imaging science & photographic technology,remote sensing,geochemistry & geophysics
What problem does this paper attempt to address?