Road extraction from remote sensing images based on a multi-scale asymmetric dual attention mechanism

Shenming Qu Suchen Liu Fengyu Han Yuan Xie Henan International Joint Laboratory of Theories and Key Technologies on Intelligence Networks,Henan University,Software College,Kaifeng,China
DOI: https://doi.org/10.1080/2150704x.2024.2370498
IF: 2.369
2024-07-25
Remote Sensing Letters
Abstract:Aiming at the problems of road fracture and detail loss caused by not considering the geometric features in the road extraction method. We proposed an encoder-decoder architecture based on a multi-scale asymmetric dual attention mechanism. Firstly, A multi-scale convolution block in the shape of 'Union Jack' is designed. It includes symmetric convolution and asymmetric convolution along horizontal, vertical, left diagonal, and right diagonal spatial directions, and a multi-scale dilated convolution for extracting features of different scales. Remote dependence relationships are highly converged by using it, and road fracture problems caused by occlusion can be solved effectively. Secondly, a directional dual attention mechanism is proposed, which consists of directional channel attention using strip pooling and a directional spatial attention mechanism using asymmetric convolution along left diagonal and right diagonal spatial directions. It can use the directivity of asymmetric convolution to allocate attention mechanism adaptively in attention mechanism, and effectively avoid the road detail loss problem. Finally, we conducted corresponding experiments on the DeepGlobe and Ottawa road datasets, and the experimental results are superior to the current state-of-the-art methods.
imaging science & photographic technology,remote sensing
What problem does this paper attempt to address?