Steel surface defect detection based on sparse global attention transformer

Yinghao Li,Zhiyong Han,Wenmeng Wang,Heping Xu,Yongpeng Wei,Guangjun Zai
DOI: https://doi.org/10.1007/s10044-024-01375-9
IF: 2.307
2024-12-04
Pattern Analysis and Applications
Abstract:The detection of surface defects in steel is a fundamental technique for verifying the quality of the material. Despite the widespread use of transformer-based detection methods in defect detection, the precision and speed are still far from satisfactory according to industry standards. A detection model based on sparse global attention is presented in this paper. We proposed a simple sparse sliding-window attention, which localizes self attention for each pixel to its near neighbors. By adjusting dilation values, a larger receptive field can be obtained to improve the detection effect of large-size defects. Then we use the Content-Aware ReAssembly of FEatures (CARAFE) feature upsampling operator, which can aggregate contextual information in the large receptive field and generate features in the predefined region in the way of content-aware to improve the effect of feature fusion. Finally, the EIOU loss is introduced to solve the problem of scale consistency of the bounding box. Through ablation experiments, we analyze the effect of different dilation values on object detection performance. The proposed algorithm achieves the Mean Average Precision (mAP) of 83.7% on the NEU-DET dataset. Through experimentation with the aluminum defect dataset, we have demonstrated that our approach is applicable to other types of defects.
computer science, artificial intelligence
What problem does this paper attempt to address?