Enhanced YOLO v3 for precise detection of apparent damage on bridges amidst complex backgrounds

Huifeng Su,David Bonfils Kamanda,Tao Han,Cheng Guo,Rongzhao Li,Zhilei Liu,Fengzhao Su,Liuhong Shang
DOI: https://doi.org/10.1038/s41598-024-58707-2
IF: 4.6
2024-04-17
Scientific Reports
Abstract:A bridge disease identification approach based on an enhanced YOLO v3 algorithm is suggested to increase the accuracy of apparent disease detection of concrete bridges under complex backgrounds. First, the YOLO v3 network structure is enhanced to better accommodate the dense distribution and large variation of disease scale characteristics, and the detection layer incorporates the squeeze and excitation (SE) networks attention mechanism module and spatial pyramid pooling module to strengthen the semantic feature extraction ability. Secondly, CIoU with better localization ability is selected as the loss function for training. Finally, the K-means algorithm is used for anchor frame clustering on the bridge surface disease defects dataset. 1363 datasets containing exposed reinforcement, spalling, and water erosion damage of bridges are produced, and network training is done after manual labelling and data improvement in order to test the efficacy of the algorithm described in this paper. According to the trial results, the YOLO v3 model has enhanced more than the original model in terms of precision rate, recall rate, Average Precision (AP), and other indicators. Its overall mean Average Precision (mAP) value has also grown by 5.5%. With the RTX2080Ti graphics card, the detection frame rate increases to 84 Frames Per Second, enabling more precise and real-time bridge illness detection.
multidisciplinary sciences
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to improve the detection accuracy in bridge surface defect detection under complex backgrounds. Specifically, the paper makes improvements on the following problems existing in the existing YOLO v3 algorithm in bridge surface defect detection: 1. **Dense distribution of defect scales and large scale changes**: Defects on the bridge surface are often densely distributed and have large scale changes, which pose challenges to defect detection. 2. **Insufficient feature extraction ability**: Although the YOLO v3 algorithm uses a multi - scale prediction method, the extracted features have low robustness and are not suitable for bridge defect detection under complex backgrounds. 3. **Poor positioning effect**: The traditional Intersection over Union (IoU) loss function cannot provide effective adjustment gradients under complex backgrounds, resulting in a decrease in defect positioning accuracy. To address these problems, the paper proposes an enhanced YOLO v3 algorithm. The main improvement points include: 1. **Spatial Pyramid Pooling module (SPP)**: Introduce the SPP module to better handle defect features of different scales and improve the richness of feature representation. 2. **Feature extraction network with SENet embedded**: Embed the attention mechanism of SENet in front of the three detection layers of YOLO v3, and enhance the feature expression ability by recalibrating channel weights. 3. **Use a better positioning loss function (CIoU)**: Introduce the CIoU loss function, which takes into account the distance between the center points of the prediction box and the ground - truth box, the overlap ratio, and the aspect ratio, making the bounding box regression more stable. 4. **Generate anchor boxes by K - means clustering**: Perform K - means clustering on the self - built bridge defect data set to generate more suitable anchor boxes to adapt to defects of different scales. Through these improvements, the paper aims to improve the accuracy and real - time performance of bridge surface defect detection, especially the detection effect under complex backgrounds. Experimental results show that the enhanced YOLO v3 algorithm is superior to the original YOLO v3 algorithm in terms of precision rate, recall rate, and average precision (AP). The overall mean average precision (mAP) value has increased by 5.5%, and the detection frame rate has reached 84 frames per second.