A new method based on YOLOv5 and multiscale data augmentation for visual inspection in substation

Junjie Chen,Siqi Pan,Yanping Chan,Yuedong Ni,Donghua Ye
DOI: https://doi.org/10.1038/s41598-024-60126-2
IF: 4.6
2024-04-24
Scientific Reports
Abstract:Artificial intelligence has demonstrated notable advancements in the realm of visual inspection and defect detection in substations. Nevertheless, practical application presents challenges, with issues arising from the dynamic shooting environment and limited dataset resulting in suboptimal defect identification accuracy and instability. To address these concerns, a pioneering approach based on hybrid pruning YOLOv5 and multiscale data augmentation is proposed for enhancing defect detection in substations. Initially, an enhanced multiscale data augmentation method is proposed. The improved multiscale data augmentation mitigates the impact of the time-varying shooting environment on recognition accuracy and enhances defect detection precision. Subsequently, YOLOv5 is employed for training and detecting defects within multi-scale image data. To alleviate the potential destabilizing effects of YOLOv5's large-scale parameters on model stability, a new model pruning method is implemented. This method strategically prunes parameters to bolster the model's defect identification accuracy. The efficacy of the proposed methodology is evaluated through testing on substation defect images, confirming its effectiveness in enhancing defect detection capabilities.
multidisciplinary sciences
What problem does this paper attempt to address?
### Problems Addressed by the Paper This paper aims to address the issue of defect identification in substation visual inspection. Specifically, the paper proposes a new method based on Pruning YOLOv5 (Hybrid Pruning YOLOv5) and Multiscale Data Augmentation (HPYMDA) to improve the accuracy and stability of defect detection in substation equipment. #### Existing Problems: 1. **Impact of Dynamic Shooting Environment**: The operation of shooting tools leads to changes in the shooting environment, affecting the image quality and thus the accuracy of defect identification. 2. **Limited Dataset**: Due to the infrequent occurrence of defects in on-site equipment, the amount of defect fault image data that can be captured is relatively small, resulting in insufficient model training and reduced accuracy in defect identification. #### Proposed Methods: 1. **Multiscale Data Augmentation**: An improved multiscale data augmentation method is introduced. By designing convolution kernels with different distribution initialization parameters, diverse multiscale features are extracted, increasing the richness of input data. 2. **Model Pruning**: A new model pruning method is proposed. Pruning is performed based on the weight distribution of feature maps, removing channels with marginal weight distribution and retaining channels with significant feature extraction effects, thereby improving the efficiency of model pruning. ### Main Contributions: 1. A new method based on Pruning YOLOv5 and Multiscale Data Augmentation is proposed, suitable for substation patrol inspection tasks. 2. A new multiscale data augmentation method is introduced, generating convolution kernel initialization weight parameters through various distribution methods to enhance feature diversity. 3. A new pruning method is proposed, determining pruning positions based on the weight distribution of feature maps, improving the accuracy of pruning and model efficiency. In summary, this paper addresses the issues in substation defect detection caused by dynamic shooting environments and limited datasets by proposing the HPYMDA method, thereby improving the accuracy and stability of defect identification.