Road Crack Detection Using Deep Convolutional Neural Network and Adaptive Thresholding

Rui Fan,Mohammud Junaid Bocus,Yilong Zhu,Jianhao Jiao,Li Wang,Fulong Ma,Shanshan Cheng,Ming Liu
DOI: https://doi.org/10.48550/arXiv.1904.08582
2019-04-18
Abstract:Crack is one of the most common road distresses which may pose road safety hazards. Generally, crack detection is performed by either certified inspectors or structural engineers. This task is, however, time-consuming, subjective and labor-intensive. In this paper, we propose a novel road crack detection algorithm based on deep learning and adaptive image segmentation. Firstly, a deep convolutional neural network is trained to determine whether an image contains cracks or not. The images containing cracks are then smoothed using bilateral filtering, which greatly minimizes the number of noisy pixels. Finally, we utilize an adaptive thresholding method to extract the cracks from road surface. The experimental results illustrate that our network can classify images with an accuracy of 99.92%, and the cracks can be successfully extracted from the images using our proposed thresholding algorithm.
Computer Vision and Pattern Recognition,Machine Learning,Image and Video Processing
What problem does this paper attempt to address?
The problem that this paper attempts to solve is road crack detection. Specifically, traditional crack detection methods usually rely on certified inspectors or structural engineers, and these methods are time - consuming, subjective and labor - intensive. Therefore, this paper proposes a novel road crack detection algorithm based on deep learning and adaptive image segmentation, aiming to improve the efficiency and accuracy of crack detection. ### Main contributions of the paper: 1. **Image classification**: Use a deep convolutional neural network (DCNN) to determine whether an image contains cracks. The network structure in this part is shown in Figure 1. Through operations such as the convolutional layer, batch normalization, ReLU activation function, max pooling, and fully - connected layer, efficient classification of images is achieved. 2. **Image segmentation**: For the classified images containing cracks, use bilateral filtering to smooth the images and reduce the influence of noisy pixels. Then, extract the crack regions from the images by the adaptive thresholding method. ### Experimental results: - **Image classification**: The experimental results show that the proposed network has an accuracy rate of 99.92% in image classification. - **Image segmentation**: In the pixel - level segmentation performance evaluation, compared with the Otsu thresholding method, the proposed method performs better in precision, accuracy, and F1 - measure. Especially when τ is set to 1, the segmentation effect is the best. ### Formula display: 1. **Bilateral filtering**: \[ i_{bf}(u,v)=\frac{\sum_{x = u-\rho}^{u + \rho}\sum_{y = v-\rho}^{v + \rho}\omega_s(x,y)\omega_c(x,y)i(x,y)}{\sum_{x = u-\rho}^{u + \rho}\sum_{y = v-\rho}^{v + \rho}\omega_s(x,y)\omega_c(x,y)} \] where, \[ \omega_s(x,y)=\exp\left(\frac{(x - u)^2+(y - v)^2}{\sigma_s^2}\right) \] \[ \omega_c(x,y)=\exp\left(\frac{(i(x,y)-i(u,v))^2}{\sigma_c^2}\right) \] 2. **Adaptive thresholding method**: - Assume that the down - sampled image consists of two parts, the foreground (cracks) and the background (road surface), and they can be separated by a threshold δ. - Use the k - means clustering algorithm to find the optimal threshold δ by minimizing the within - cluster sum of squares: \[ \arg\min_S\sum_{i = 1}^2\sum_{m\in S_i}\|m-\mu_i\|^2 \] where \(\mu_i\) represents the mean of the vectors in cluster \(S_i\). ### Conclusions and future work: This paper proposes a new road crack detection method, including a fully - connected neural network for image classification and an image segmentation algorithm based on k - means clustering. The experimental results show that this method performs well in image classification and pixel - level segmentation. Future work can further explore the use of deep neural networks for image segmentation to improve the ability to detect cracks in complex backgrounds.