Abstract:Strip steel is extensively utilized in industries such as automotive manufacturing and aerospace due to its superior machinability, economic benefits, and adaptability. However, defects on the surface of steel strips, such as inclusions, patches, and scratches, significantly affect the performance and service life of the product. Therefore, the salient object detection of surface defects on strip steel is crucial to ensure the quality of the final product. Many factors, such as the low contrast of surface defects on strip steel, the diversity of defect types, complex texture structures, and irregular defect distribution, hinder existing detection technologies from accurately identifying and segmenting defect areas against complex backgrounds. To address the above problems, we propose a novel detector called S3D-SOD for the salient object detection of strip steel surface defects. For the encoding stage, a residual self-attention block is proposed to explore semantic information cues of high-level features to locate and guide low-level feature information. In addition, we apply a general residual channel and spatial attention to low-level features, enabling the model to adaptively focus on the key channels and spatial areas of feature maps with high resolutions, thereby enhancing the encoder features and accelerating the convergence of the model. For the decoding stage, a simple residual decoder block with an upsampling operation is proposed to realize the integration and interaction of feature information between different layers. Here, the simple residual decoder block is used for feature integration due to the following observation: backbone networks like ResNet and the Swin Transformer, after being pretrained on the large dataset ImageNet and then fine-tuned on a smaller dataset for strip steel surface defects, are capable of extracting feature maps that contain both general image features and the specific characteristics required for the salient object detection of strip steel surface defects. The experimental results on the SD-saliency-900 dataset show that S3D-SOD is better than advanced methods, and it has strong generalization ability and robustness.

CAT-EDNet: Cross-Attention Transformer-Based Encoder–Decoder Network for Salient Defect Detection of Strip Steel Surface

EDRNet: Encoder–Decoder Residual Network for Salient Object Detection of Strip Steel Surface Defects

Dense Attention-Guided Cascaded Network for Salient Object Detection of Strip Steel Surface Defects

DCAM-Net: A Rapid Detection Network for Strip Steel Surface Defects Based on Deformable Convolution and Attention Mechanism

CGTD-Net: Channel-Wise Global Transformer-Based Dual-Branch Network for Industrial Strip Steel Surface Defect Detection

Two-Stage Edge Reuse Network for Salient Object Detection of Strip Steel Surface Defects

Edge-Aware Multi-Level Interactive Network for Salient Object Detection of Strip Steel Surface Defects

ACPP-Net: Enhancing Strip Steel Surface Defect Detection With Efficient Adaptive Convolution and Channel-Spatial Pyramid Pooling

TAFENet: A Two-Stage Attention-Based Feature-Enhancement Network for Strip Steel Surface Defect Detection

A Strip Steel Surface Defect Salient Object Detection Based on Channel, Spatial and Self-Attention Mechanisms

AEGLR-Net: Attention Enhanced Global-Local Refined Network for Accurate Detection of Car Body Surface Defects

AEDN-YOLO: an efficient one-stage detection network for strip steel surface defects

DAssd-Net: A Lightweight Steel Surface Defect Detection Model Based on Multi-Branch Dilated Convolution Aggregation and Multi-Domain Perception Detection Head

Cascaded adaptive global localisation network for steel defect detection

A steel defect detection method based on edge feature extraction via the Sobel operator

Rethinking Interactive Networks and Regression Loss Functions for Industrial Defect Detection

Surface defect detection of hot rolled steel based on multi-scale feature fusion and attention mechanism residual block

Surface Defect Detection of Steel Strips Based on Anchor-Free Network With Channel Attention and Bidirectional Feature Fusion

Steel surface defect detection based on sparse global attention transformer

ETDNet: Efficient Transformer-Based Detection Network for Surface Defect Detection

Resformer-Unet: A U-shaped Framework Combining ResNet and Transformer for Segmentation of Strip Steel Surface Defects