A Road Crack Segmentation Method Based on Transformer and Multi-Scale Feature Fusion

Yang Xu,Yonghua Xia,Quai Zhao,Kaihua Yang,Qiang Li
DOI: https://doi.org/10.3390/electronics13122257
IF: 2.9
2024-06-09
Electronics
Abstract:To ensure the safety of vehicle travel, the maintenance of road infrastructure has become increasingly critical, with efficient and accurate detection techniques for road cracks emerging as a key research focus in the industry. The development of deep learning technologies has shown tremendous potential in improving the efficiency of road crack detection. While convolutional neural networks have proven effective in most semantic segmentation tasks, overcoming their limitations in road crack segmentation remains a challenge. To address this, this paper proposes a novel road crack segmentation network that leverages the powerful spatial feature modeling capabilities of Swin Transformer and the Encoder–Decoder architecture of DeepLabv3+. Additionally, the incorporation of a multi-scale coding module and attention mechanism enhances the network's ability to densely fuse multi-scale features and expand the receptive field, thereby improving the integration of information from feature maps. Performance comparisons with current mainstream semantic segmentation models on crack datasets demonstrate that the proposed model achieves the best results, with an MIoU of 81.06%, Precision of 79.95%, and F1-score of 77.56%. The experimental results further highlight the model's superior ability in identifying complex and irregular cracks and extracting contours, providing guidance for future applications in this field.
engineering, electrical & electronic,computer science, information systems,physics, applied
What problem does this paper attempt to address?