Dense Multiscale Feature Learning Transformer Embedding Cross-Shaped Attention for Road Damage Detection

Chuan Xu,Qi Zhang,Liye Mei,Sen Shen,Zhaoyi Ye,Di Li,Wei Yang,Xiangyang Zhou
DOI: https://doi.org/10.3390/electronics12040898
IF: 2.9
2023-02-10
Electronics
Abstract:Road damage detection is essential to the maintenance and management of roads. The morphological road damage contains a large number of multi-scale features, which means that existing road damage detection algorithms are unable to effectively distinguish and fuse multiple features. In this paper, we propose a dense multiscale feature learning Transformer embedding cross-shaped attention for road damage detection (DMTC) network, which can segment the damage information in road images and improve the effectiveness of road damage detection. Our DMTC makes three contributions. Firstly, we adopt a cross-shaped attention mechanism to expand the perceptual field of feature extraction, and its global attention effectively improves the feature description of the network. Secondly, we use the dense multi-scale feature learning module to integrate local information at different scales, so that we are able to overcome the difficulty of detecting multiscale targets. Finally, we utilize a multi-layer convolutional segmentation head to generalize the previous feature learning and get a final detection result. Experimental results show that our DMTC network could segment pavement pothole patterns more accurately and effectively than other methods, achieving an F1 score of 79.39% as well as an OA score of 99.83% on the cracks-and-potholes-in-road-images-dataset (CPRID).
engineering, electrical & electronic,computer science, information systems,physics, applied
What problem does this paper attempt to address?