A Recover-then-Discriminate Framework for Robust Anomaly Detection

Peng Xing,Dong Zhang,Jinhui Tang,Zechao li
2024-06-07
Abstract:Anomaly detection (AD) has been extensively studied and applied in a wide range of scenarios in the recent past. However, there are still gaps between achieved and desirable levels of recognition accuracy for making AD for practical applications. In this paper, we start from an insightful analysis of two types of fundamental yet representative failure cases in the baseline model, and reveal reasons that hinder current AD methods from achieving a higher recognition accuracy. Specifically, by Case-1, we found that the main reasons detrimental to current AD methods is that the inputs to the recovery model contain a large number of detailed features to be recovered, which leads to the normal/abnormal area has-not/has been recovered into its original state. By Case-2, we surprisingly found that the abnormal area that cannot be recognized in image-level representations can be easily recognized in the feature-level representation. Based on the above observations, we propose a novel Recover-then-Discriminate (ReDi) framework for AD. ReDi takes a self-generated feature map and a selected prompted image as explicit input information to solve problems in case-1. Concurrently, a feature-level discriminative network is proposed to enhance abnormal differences between the recovered representation and the input representation. Extensive experimental results on two popular yet challenging AD datasets validate that ReDi achieves the new state-of-the-art accuracy.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The problem that this paper attempts to solve is: Existing Anomaly Detection (AD) methods fail to achieve the ideal recognition accuracy in practical applications. Specifically, the paper analyzes two basic and representative failure cases in existing models and reveals the reasons behind these cases, thus proposing an improvement plan. ### Background and Challenges of the Problem 1. **Case - 1**: In the recovery - based method, the input image contains a large number of detailed features that need to be recovered, resulting in the normal area not being fully recovered to the original state, while the abnormal area is wrongly recovered to the normal state instead. 2. **Case - 2**: The abnormal area that is difficult to identify in the image - level representation is easy to identify in the feature - level representation. This indicates that the existing recovery models cannot ensure extremely low recovery errors in the normal area at the pixel dimension and have problems in boundary alignment, which may lead to false detection of the abnormal area. ### Core Contributions of the Paper To solve the above problems, the author proposes a new framework - the **Recover - then - Discriminate (ReDi)** framework. The main innovation points of this framework include: 1. **Recover Network**: - Using self - generated feature maps and cue images as explicit input information to solve the problem in Case - 1. - Specifically, the HOG feature map and the normal cue image with high similarity are introduced to guide image recovery, ensuring the accurate recovery of the normal area and at the same time generating a large recovery error in the abnormal area. 2. **Discriminate Network**: - A feature - level discriminative network is proposed to enhance the abnormal difference between the recovery representation and the input representation, solving the problem in Case - 2. - The self - correlation loss is introduced to further constrain the consistency between the recovery features of normal samples and the reference features. ### Experimental Results Through extensive experiments on two popular AD datasets (MVTec - AD and KolektorSDD2), it is verified that the ReDi framework has reached the latest state - of - the - art recognition accuracy. ### Summary The main contributions of the paper are: - Proposing a new Recover - then - Discriminate framework, which combines the recovery network and the discriminative network. - Using self - generated feature maps and cue images as input information to solve the "same - shortcut" problem in the recovery process. - Introducing a feature - level discriminative network and self - correlation loss to enhance the accuracy of anomaly detection. Through these innovations, the ReDi framework significantly improves the performance of anomaly detection, especially showing excellent performance in the abnormal segmentation task in complex scenarios.