Crack Detection in Infrastructure Using Transfer Learning, Spatial Attention, and Genetic Algorithm Optimization

Feng Ding
2024-11-26
Abstract:Crack detection plays a pivotal role in the maintenance and safety of infrastructure, including roads, bridges, and buildings, as timely identification of structural damage can prevent accidents and reduce costly repairs. Traditionally, manual inspection has been the norm, but it is labor-intensive, subjective, and hazardous. This paper introduces an advanced approach for crack detection in infrastructure using deep learning, leveraging transfer learning, spatial attention mechanisms, and genetic algorithm(GA) optimization. To address the challenge of the inaccessability of large amount of data, we employ ResNet50 as a pre-trained model, utilizing its strong feature extraction capabilities while reducing the need for extensive training datasets. We enhance the model with a spatial attention layer as well as a customized neural network which architecture was fine-tuned using GA. A comprehensive case study demonstrates the effectiveness of the proposed Attention-ResNet50-GA model, achieving a precision of 0.9967 and an F1 score of 0.9983, outperforming conventional methods. The results highlight the model's ability to accurately detect cracks in various conditions, making it highly suitable for real-world applications where large annotated datasets are scarce.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The problem that this paper attempts to solve is the challenges in crack detection in infrastructure (such as roads, bridges and buildings). Specifically, traditional manual inspection methods are labor - intensive, highly subjective and dangerous, and cannot meet the requirements of timeliness and accuracy. In addition, although automated image detection methods have potential, they rely on simple machine - learning algorithms and perform poorly in cases of changes in materials, lighting and surface conditions. To solve these problems, the author proposes an advanced method based on deep learning, using transfer learning, spatial attention mechanisms and genetic algorithm optimization to improve the accuracy and efficiency of crack detection. The main objectives include: 1. **Reduce the dependence on a large amount of labeled data**: By using the pre - trained ResNet50 model for feature extraction, the need for large - scale labeled data is reduced. 2. **Enhance the model's focus on crack features**: The spatial attention mechanism is introduced to enable the model to focus on the most relevant areas in the image, such as edges and irregular patterns, thereby improving the accuracy of crack detection. 3. **Optimize the neural network architecture**: Use the genetic algorithm to optimize the structure of the customized neural network to further improve the model performance. Through these technical means, this research aims to provide a more accurate, adaptable and scalable solution, which is suitable for the situation where large - scale labeled data is scarce in practical applications.