SPACE: SPAtial-aware Consistency rEgularization for anomaly detection in Industrial applications

Daehwan Kim,Hyungmin Kim,Daun Jeong,Sungho Suh,Hansang Cho
2024-11-05
Abstract:In this paper, we propose SPACE, a novel anomaly detection methodology that integrates a Feature Encoder (FE) into the structure of the Student-Teacher method. The proposed method has two key elements: Spatial Consistency regularization Loss (SCL) and Feature converter Module (FM). SCL prevents overfitting in student models by avoiding excessive imitation of the teacher model. Simultaneously, it facilitates the expansion of normal data features by steering clear of abnormal areas generated through data augmentation. This dual functionality ensures a robust boundary between normal and abnormal data. The FM prevents the learning of ambiguous information from the FE. This protects the learned features and enables more effective detection of structural and logical anomalies. Through these elements, SPACE is available to minimize the influence of the FE while integrating various data <a class="link-external link-http" href="http://augmentations.In" rel="external noopener nofollow">this http URL</a> this study, we evaluated the proposed method on the MVTec LOCO, MVTec AD, and VisA datasets. Experimental results, through qualitative evaluation, demonstrate the superiority of detection and efficiency of each module compared to state-of-the-art methods.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
### What problem does this paper attempt to solve? This paper aims to solve the Anomaly Detection (AD) problem in industrial applications, especially when it is impractical to obtain abnormal samples for training. Specifically, the paper proposes a new method - **SPACE (SPAtial - aware Consistency rEgularization)** to address the following challenges: 1. **Limitations of data augmentation**: In industrial AD datasets, the abnormal area usually accounts for only a small part of the entire image, while the rest is normal. Therefore, strong data augmentation will make the normal data closer to the abnormal data instead of maintaining its normal characteristics. This makes existing methods based on strong augmentation difficult to effectively distinguish between normal and abnormal data. 2. **Over - fitting of the student model**: In the Student - Teacher (S - T) framework, the student model may over - imitate the teacher model, leading to over - fitting. This will reduce the ability to detect real anomalies. 3. **Detection of logical anomalies**: In addition to structural anomalies, there are also logical anomalies in industrial applications, that is, a single object may be normal, but the overall image level is abnormal. For example, in the push - nail dataset, there should be one nail in one position, but if there are more than one nail or no nail, it is considered a logical anomaly. To overcome these challenges, SPACE introduces two key components: - **Spatial Consistency regularization Loss (SCL)**: SCL prevents over - fitting by avoiding excessive imitation of the teacher model by the student model, and expands the boundaries of normal data by selectively learning features, thereby ensuring a robust boundary between normal and abnormal data. - **Feature converter Module (FM)**: FM prevents learning of ambiguous information from the feature encoder (FE), protects the learned features, and improves the effectiveness of structural and logical anomaly detection. Through these improvements, SPACE can more effectively detect structural and logical anomalies in industrial applications while using weakly - augmented and strongly - augmented data for training. ### Summary The paper proposes a new anomaly detection method SPACE, which aims to solve the challenges faced by anomaly detection in industrial applications, especially the limitations of data augmentation and the over - fitting problem of the student model. By introducing SCL and FM, SPACE can more effectively detect structural and logical anomalies and has achieved better performance than existing methods on multiple benchmark datasets.