Abstract:Roads extracted from high-resolution remote sensing images are widely used in many fields, such as autonomous driving, road planning, disaster relief, etc. However, road extraction from high-resolution remote sensing images has certain deficiencies in connectivity and completeness due to obstruction by surrounding ground objects, the influence of similar targets, and the slender structure of roads themselves. To address this issue, we propose a novel dual-path convolutional neural network with a strip dilated attention module, named DPSDA-Net, which adopts a U-shaped encoder–decoder structure, combining the powerful advantages of attention mechanism, dilated convolution, and strip convolution. The encoder utilizes ResNet50 as its basic architecture. A strip position attention mechanism is added between each residual block to strengthen the coherent semantic information of a road. A long-distance shortcut connection operation is introduced to preserve the spatial information characteristics of the original image during the downsampling process. At the same time, a pyramid dilated module with a strip convolution and attention mechanism is constructed between the encoder and decoder to enhance the network feature extraction ability and multi-scale extraction of road feature information, expand the model's receptive field, and pay more attention to the global spatial semantic and connectivity information. To verify the reliability of the proposed model, road extraction was carried out on the Massachusetts dataset and the LRSNY dataset. The experimental results show that, compared with other typical road extraction methods, the proposed model achieved a higher F1 score and IOU. The DPSDA-Net model can comprehensively characterize the structural features of roads, extract roads more accurately, retain road details, and improve the connectivity and integrity of road extraction in remote sensing images.

Road Extraction by Multiscale Deformable Transformer from Remote Sensing Images

Road Extraction by Multi-scale Deformable Transformer from Remote Sensing Images

BDTNet: Road Extraction by Bi-Direction Transformer From Remote Sensing Images

DDCTNet: A Deformable and Dynamic Cross-Transformer Network for Road Extraction From High-Resolution Remote Sensing Images

Road Extraction From Remote Sensing Images via Channel Attention and Multilayer Axial Transformer

DCTNET: HYBRID NETWORK MODEL FUSING WITH MULTISCALE DEFORMABLE CNN AND TRANSFORMER STRUCTURE FOR ROAD EXTRACTION FROM GAOFEN SATELLITE REMOTE SENSING IMAGE

Multiscale Global Attention Network With Edge Perceptron for Automatic Road Extraction From Remote Sensing Imagery

MCMCNet: A Semi-supervised Road Extraction Network for High-resolution Remote Sensing Images Via Multiple Consistency and Multi-task Constraints

Lightweight remote sensing road detection with an attention-augmented transformer

Road Extraction Convolutional Neural Network with Embedded Attention Mechanism for Remote Sensing Imagery

UMiT-Net: A U-Shaped Mix-Transformer Network for Extracting Precise Roads Using Remote Sensing Images

DPSDA-Net: Dual-Path Convolutional Neural Network with Strip Dilated Attention Module for Road Extraction from High-Resolution Remote Sensing Images

Seg-Road: A Segmentation Network for Road Extraction Based on Transformer and CNN with Connectivity Structures

Dense Multiscale Feature Learning Transformer Embedding Cross-Shaped Attention for Road Damage Detection

DEGANet: Road Extraction Using Dual-Branch Encoder With Gated Attention Mechanism

MECA-Net: A MultiScale Feature Encoding and Long-Range Context-Aware Network for Road Extraction from Remote Sensing Images

C2S-RoadNet: Road Extraction Model with Depth-Wise Separable Convolution and Self-Attention

RoadCT: A Hybrid CNN-Transformer Network for Road Extraction From Satellite Imagery

Road Extraction from High-Resolution Remote Sensing Images via Local and Global Context Reasoning

A Lightweight High-Resolution RS Image Road Extraction Method Combining Multi-Scale and Attention Mechanism

URoadNet: Dual Sparse Attentive U-Net for Multiscale Road Network Extraction