Synthetic data augmentation for high-resolution X-ray welding defect detection and classification based on a small number of real samples
Liangliang Li,Peng Wang,Jia Ren,Zhigang Lü,Xiaoyan Li,Hui Gao,RuoHai Di
DOI: https://doi.org/10.1016/j.engappai.2024.108379
IF: 8
2024-04-13
Engineering Applications of Artificial Intelligence
Abstract:Deep learning has become the dominant technology in most computer vision tasks. These methods often rely on a large number of labeled sample datasets for training, and in the field of non-destructive testing of welds in industrial manufacturing, weld images with defects are very scarce, and it is still a challenging challenge to construct high-resolution weld defect datasets that meet the requirements. To overcome this limitation, a new data augmentation method for high-resolution X-ray welding defect classification and synthesis based on a small number of real samples is proposed to realize the data augmentation of industrial nondestructive inspection X-ray film defect images. Firstly, to overcome the scarcity of the weld X-ray defect classification dataset, the weld X-ray defect classification dataset (Weld Defect Classification, WDC) is constructed. Secondly, the performance of 16 common deep classification models on WDC datasets is explored. Then, the images of the real local welding defects and the non-defective weld area are fused at random locations, and two data augmentation modes, (Single Image Single Defect, SISD) and (Single Image Multi Defects, SIMD), can generate defect files and annotation files (Visual Object Classes, VOC) at the same time, which can save a lot of time for manual marking. Finally, compared with the traditional data augmentation method, the proposed method can effectively improve the accuracy of defect detection and generalization, the mAP (Mean Average Precision, mAP) @0.5 of YOLOV8X (You Only Look Once, YOLO) and YOLOV5.6.1X is 66.6% and 72.8%, which provides an effective solution for data sample generation in the industrial field.
automation & control systems,computer science, artificial intelligence,engineering, electrical & electronic, multidisciplinary