DSHFNet: Dynamic Scale Hierarchical Fusion Network Based on Multiattention for Hyperspectral Image and LiDAR Data Classification

Yining Feng,Liyang Song,Lu Wang,Xianghai Wang
DOI: https://doi.org/10.1109/tgrs.2023.3311535
IF: 8.2
2023-01-01
IEEE Transactions on Geoscience and Remote Sensing
Abstract:With the continuous improvement of satellite sensor performance, it is becoming easier to obtain different types of remote sensing (RS) data from multiple sensors, and the fusion of hyperspectral (HS) images and light detection and ranging (LiDAR) for land use/land cover (LULC) classification has become a research hotspot. However, the current mainstream methods still have defects in feature extraction and feature fusion. In the feature extraction stage, previous methods usually use a single-scale patch as input and a fixed convolution kernel for feature extraction, which makes it difficult to extract features in line with different land cover types at the same time and to obtain high-quality features. Although multiscale feature extraction can solve the one-sidedness problem of single-scale features, it also brings the challenge of high-dimensional multiscale features. In the feature fusion stage, the current fusion methods are relatively simple. Therefore, we propose a dynamic scale hierarchical fusion network (DSHFNet) for fusion classification of HS images and LiDAR data. By calculating the similarity in the scale space and judging the information at different scales through the threshold value, the appropriate scale features are dynamically selected, the small-scale features are integrated into the large-scale features, and the dimensionality of the features is reduced. This method solves the unreliability problem of single-scale features and the high-dimensional problem of multiscale features. In the feature fusion process, different attention modules are used for hierarchical fusion, spatial attention modules are used for shallow fusion and joint feature extraction, and modal attention modules are used for deep fusion of joint features and features from different sensors to achieve complete complementarity of features. Experimental evaluations on three real RS datasets demonstrate the superiority of the proposed method compared with existing methods. The source code can be downloaded at https://github.com/SYFYN0317/DSHFNet .
What problem does this paper attempt to address?