Abstract:Accurate identification of cracks at the pixel level on intricate asphalt pavements represents a crucial challenge in the domain of intelligent pavement assessment. The current advanced deep-learning networks encounter limitations in simultaneously capturing both the global context and local features of cracks, leading to discontinuous segmentation results and suboptimal recovery of local details. This paper proposes a robust architecture named Mix-Graph CrackNet to present an efficacious solution for this challenge. The Mix-Graph CrackNet, as proposed, is designed to mix the global context and local features multiple times, allowing for a comprehensively understanding of the essential features. Specifically, this paper develops the learnable parallel convolutional-Transformer mixing module to parallelly capture the sophisticated local features as well as the crucial global context. In addition, a new fusion unit is devised in the paper and deployed in the learnable parallel convolutional-Transformer mixing module. The proposed fusion unit is capable of effectively mixing contextual features extracted at both global and local scales while retaining an abundant level of textural details germane to the crack. Moreover, this paper constructs a graph-based skip connection that functions as a shortcut connecting the encoder and decoder, with the primary objective of mitigating information decay. The experimental results are remarkable, with the Mix-Graph CrackNet achieving F-measure and Intersection-Over-Union of 90.37% and 82.43%, respectively, on 1000 testing images. Based on the performance evaluations conducted on both public and private datasets, the proposed Mix-Graph CrackNet architecture demonstrates a significantly superior detection accuracy in comparison to several state-of-the-art models for semantic segmentation.

A Road Crack Segmentation Method Based on Transformer and Multi-Scale Feature Fusion

A novel transformer-based network with attention mechanism for automatic pavement crack detection

Multi-scale feature fusion for pavement crack detection based on Transformer

Image-based Concrete Crack Detection in Tunnels Using Deep Fully Convolutional Networks

Detection of Road Crack Images Based on Multistage Feature Fusion and a Texture Awareness Method

Adaptive Canny and Semantic Segmentation Networks Based on Feature Fusion for Road Crack Detection

Dual-path network combining CNN and transformer for pavement crack segmentation

FCT-Net: A dual-encoding-path network fusing atrous spatial pyramid pooling and transformer for pavement crack detection

UTE-CrackNet: transformer-guided and edge feature extraction U-shaped road crack image segmentation

Pavement Crack Detection and Segmentation Method Based on Improved Deep Learning Fusion Model

A novel real-time pixel-level road crack segmentation network

CrackT-net: a Method of Convolutional Neural Network and Transformer for Crack Segmentation

Roadway Crack Segmentation Based on an Encoder-decoder Deep Network with Multi-scale Convolutional Blocks

Automatic crack detection on concrete and asphalt surfaces using semantic segmentation network with hierarchical Transformer

CrackNet: A Hybrid Model for Crack Segmentation with Dynamic Loss Function

A lightweight feature attention fusion network for pavement crack segmentation

Robust Semantic Segmentation for Automatic Crack Detection Within Pavement Images Using Multi-Mixing of Global Context and Local Image Features

An average pooling designed Transformer for robust crack segmentation

A crack detection network with multi-channel attention and enhanced information interaction

Automatic concrete infrastructure crack semantic segmentation using deep learning

Structural Damage Semantic Segmentation Using Dual-Network Fusion