Abstract:Industrial surface defect detection is an important part of industrial production, which aims to identify and detecting various defects on the surface of product to ensure quality and meet customer requirements. With the development of deep learning and image processing technologies, the surface defect detection methods based on computer vision has become the mainstream method. However, the prevalent convolutional neural network(CNN)-based de-fect detection methods also have many problems. For example, these methods rely on post-processing of Non-Maximum Suppression (NMS) and have poor detection ability for small targets, which affects the speed and accuracy of surface defect detection in industrial scenar-ios. Therefore, we propose a novel DEtection TRansformer (DETR)-based surface defect detection method. Firstly, we propose a Multi-scale Contextual Information Dilated module and fuse it into the backbone. The module is mainly composed of large kernel convolutions, which aims to expand the receptive field of the model, thus reducing the leakage rate of the model. Moreover, we design an efficient encoder which mainly contains two important mod-ules, namely feature enhancement based on cascaded group attention module and efficient feature fusion module based on content-aware. The former module effectively enhances the high-level semantic information extracted by the backbone, thus enabling the model to better interpret features, and it can improve the problem of high computational cost of transformer encoder, thus increasing the detection speed. The latter module performs multi-scale feature fusion across the feature information of various scales, thus improving the detection accura-cy of the model for small-size defects. Experimental results show that the proposed method achieves 80.6%mAP and 80.3FPS on NEU-DET, and 98.0%mAP and 79.4FPS on PCB-DET. Our proposed method exhibits excellent detection performance and achieves real-time and efficient surface defect detection capability to meet the needs of industrial surface defect detection.

Cross Resolution Encoding-Decoding For Detection Transformers

A Transformer-Based Object Detector with Coarse-Fine Crossing Representations

DETR++: Taming Your Multi-Scale Detection Transformer

Towards Data-Efficient Detection Transformers

AugDETR: Improving Multi-scale Learning for Detection Transformer

Lite DETR : An Interleaved Multi-Scale Encoder for Efficient DETR

Semantic-Aligned Matching for Enhanced DETR Convergence and Multi-Scale Feature Fusion

D^2ETR: Decoder-Only DETR with Computationally Efficient Cross-Scale Attention

Sparse DETR: Efficient End-to-End Object Detection with Learnable Sparsity

Deformable DETR: Deformable Transformers for End-to-End Object Detection

DETR-ORD: An Improved DETR Detector for Oriented Remote Sensing Object Detection with Feature Reconstruction and Dynamic Query

Efficient DETR: Improving End-to-End Object Detector with Dense Prior

AParC-DETR: Accelerate DETR training by introducing Adaptive Position-aware Circular Convolution

Conditional DETR for Fast Training Convergence.

Investigating the Robustness and Properties of Detection Transformers (DETR) Toward Difficult Images

Recurrent Glimpse-based Decoder for Detection with Transformer

L-DETR: A Light-Weight Detector for End-to-End Object Detection With Transformers

DETRs Beat YOLOs on Real-time Object Detection

REDef-DETR: Real-time and Efficient DETR for industrial surface defect detection

Focus-Attention Approach in Optimizing DETR for Object Detection from High-Resolution Images

PR-Deformable DETR: DETR for Remote Sensing Object Detection