Abstract:Existing tunnel detection methods include crack and water‐leakage segmentation networks. However, if the automated detection algorithm cannot process all defect cases, manual detection is required to eliminate potential risks. The existing intelligent detection methods lack a universal method that can accurately segment all types of defects, particularly when multiple defects are superimposed. To address this issue, a defect segmentation model is proposed based on Vision Transformer (ViT), which is completely different from the network structure of a convolutional neural network. The model proposes an adapter and a decoding head to improve the training effect of the transformer encoder, allowing it to be fitted to small‐scale datasets. In post‐processing, a method is proposed to quantify the threat level for the defects, with the aim of outputting qualitative results that simulate human observation. The model showed impressive results on a real‐world dataset containing 11,781 defect images collected from a real subway tunnel. The visualizing results proved that this method is effective and has uniform criteria for single, multiple, and comprehensive defects. Moreover, the tests proved that the proposed model has a significant advantage in the case of multiple‐defect superposition, and it achieved 93.77%, 88.36%, and 92.93% for mean accuracy (Acc), mean intersection over union, and mean F1‐score, respectively. With similar training parameters, the Acc of the proposed method is improved by more than 10% over the DeepLabv3+, Mask R‐convolutional neural network, and UPerNet‐R50 models and by more than 5% over the Swin Transformer and ViT‐Adapter. This study implemented a general method that can process all defect cases and output the threat evaluation results, thereby making more intelligent tunnel detection.

A Unet-inspired spatial-attention transformer model for segmenting gear tooth surface defects

Defect-aware transformer network for intelligent visual surface defect detection

ETDNet: Efficient Transformer-Based Detection Network for Surface Defect Detection

Defect detection of gear parts in virtual manufacturing

Steel surface defect detection based on sparse global attention transformer

Defect transformer: An efficient hybrid transformer architecture for surface defect detection

Image segmentation using Vision Transformer for tunnel defect assessment

Wind Turbine Gearbox Gear Surface Defect Detection Based on Multiscale Feature Reconstruction

A Sub-Region Unet for Weak Defects Segmentation with Global Information and Mask-Aware Loss.

A cascaded combination method for defect detection of metal gear end-face

STMS-YOLOv5: A Lightweight Algorithm for Gear Surface Defect Detection

Resformer-Unet: A U-shaped Framework Combining ResNet and Transformer for Segmentation of Strip Steel Surface Defects

Surface defect detection and classification of steel using an efficient Swin Transformer

Automated Detection of Gear Tooth Flank Surface Integrity: A Cascade Detection Approach Using Machine Vision

A Novel Gear Defect Detection Method Through Improved Yolov5 Network Using Attention Mechanism and Feature Fusion

Strategies to Prevent Catheter-Associated Urinary Tract Infection

Transformer-based visual inspection algorithm for surface defects

AnomalySeg: Deep Learning-Based Fast Anomaly Segmentation Approach for Surface Defect Detection

A Real-Time Surface Defect Detection System for Industrial Products with Long-Tailed Distribution

A Dynamic Transformer Network With Early Exit Mechanism for Fast Detection of Multiscale Surface Defects

RDAD: A reconstructive and discriminative anomaly detection model based on transformer