Deep Convolutional Neural Networks for Semantic Segmentation of Cracks

Jia-Ji Wang,Yu-Fei Liu,Xin Nie,Y. L. Mo
DOI: https://doi.org/10.1002/stc.2850
2021-01-01
Structural Control and Health Monitoring
Abstract:A large crack detection dataset of 2446 manually labeled images is established to cover a wide range of noise and to evaluate the performance of end-to-end deep convolutional networks in detecting cracking. Five state-of-the-art end-to-end deep computer vision architectures for semantic segmentation are trained and evaluated, including Fully Convolutional Network (FCN), Global Convolutional Network (GCN), Pyramid Scene Parsing Network (PSPNet), UPerNet, and DeepLabv3+. For the backbones, the VGG, ResNet, and DenseNet are adopted. Based on the comparison of test set metrics, DeepLabv3+ with the ResNet101 backbone achieved the highest IoU of 0.6298, the highest recall of 0.6834, and the highest F1 score of 0.7732. The influence of database choice and image noise on crack detection performance is reported. Based on the comparison of predicted images, UperNet with ResNet101 backbone shows the highest performance for images with shadings, while DeepLabv3+ with ResNet101 backbone shows the best performance for images with blemishes. The research outcome can provide reference for the application of fast and accurate detection of cracks in civil engineering.
What problem does this paper attempt to address?