SABNet:Self-Attention Bilateral Network for Land Cover Classification

Zhehao Hu,Yurong Qian,ZhengQing Xiao,Guangqi Yang,Hao Jiang,Xiao Sun
DOI: https://doi.org/10.1109/jstars.2024.3382096
IF: 4.715
2024-01-01
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
Abstract:Land cover classification has been of great interest as one of the most prominent applications of remote sensing images. The emergence of convolutional neural networks has largely promoted the development of land cover classification, but it ignores the positional relationship between pixels. When remotely sensed features have both large intraclass scale differences and interclass similarities, it will result in the problems of fuzzy class boundaries of classification results and misclassification of small samples, which are difficult to be solved by existing methods. Inspired by the recent Transformer network, we propose a self-attentive bilateral network SABNet to alleviate these problems. Its backbone consists of a modified multiscale vision transformer and a stacked convolutional layer for extracting global spatial information and local contextual information. A local embedding module and a coordinate attention fusion module are further proposed in the feature fusion stage to reduce attention distraction and efficiently fuse the high and low features. A stepwise feature fusion module is proposed in the decoder to fully fuse the features extracted from the two branches. Experiments show that our method achieves the best results in mIoU on both Landcover.ai and GID-15 datasets with a similar number of parameters, 91.49% for the Landcover.ai dataset and 64.23% for the GID-15 dataset, compared with existing methods.
What problem does this paper attempt to address?