Abstract:The task of image anomaly detection (IAD) aims to identify deviations from normality in image data. These anomalies are patterns that deviate significantly from what the IAD model has learned from the data during training. However, in real-world scenarios, the criteria for what constitutes normality often change, necessitating the reclassification of previously anomalous instances as normal. To address this challenge, we propose a new scenario termed "normality addition," involving the post-training adjustment of decision boundaries to incorporate new normalities. To address this challenge, we propose a method called Normality Addition via Normality Detection (NAND), leveraging a vision-language model. NAND performs normality detection which detect patterns related to the intended normality within images based on textual descriptions. We then modify the results of a pre-trained IAD model to implement this normality addition. Using the benchmark dataset in IAD, MVTec AD, we establish an evaluation protocol for the normality addition task and empirically demonstrate the effectiveness of the NAND method.
What problem does this paper attempt to address?
The problem that this paper attempts to solve is in industrial image anomaly detection models, how to adjust the decision boundary of the model by adding new normal patterns (normality) without retraining or fine - tuning the model. Specifically, when the operating conditions or quality control standards change, some situations that were previously considered abnormal may need to be reclassified as normal. For example, in the manufacturing process, some features that were originally regarded as defects (such as thread ends in carpets) may be considered normal due to process changes. Therefore, the paper proposes a method named "Normality Addition via Normality Detection (NAND)" to adapt to such changes.
### Problem Background
In practical applications, industrial image anomaly detection models are usually trained only with normal data during training, so that they can identify abnormal situations that are significantly different from normal data. However, with the change of the production environment or the update of quality control standards, some situations that were previously considered abnormal may need to be redefined as normal. The traditional solution is to collect new data and retrain or fine - tune the model, but this is both time - consuming and resource - intensive. Especially in industries where abnormal situations are relatively rare, it is particularly difficult to obtain sufficient training data.
### Proposed Solution
To meet this challenge, the paper proposes a new scenario named "Normality Addition" and introduces a method based on the Vision - Language Model (VLM) - NAND. The main contributions of NAND include:
1. **Proposing the "Normality Addition" problem**: Defining a new task, that is, adjusting the decision boundary of the model by adding new normal patterns without retraining the model.
2. **Introducing the NAND method**: Using VLM to generate text prompts describing the newly added normal patterns, and modifying the output of the existing anomaly detection model through these prompts, so that the model can recognize the newly added normal patterns.
3. **Empirical verification**: Evaluating the NAND method using the MVTec AD benchmark dataset and demonstrating its effectiveness.
### Working Principle of NAND
The specific workflow of NAND is as follows:
1. **Prompt Generator**: According to the given text description (such as "thread"), generate a series of text prompts describing the newly added normal patterns. These prompts are converted into feature vectors by the text encoder of VLM.
2. **Normality Detection Module**: Using methods such as APRIL - GAN to detect the regions related to the newly added normal patterns in the query image and generate a suppression map. This suppression map is used to suppress the anomaly scores of the original anomaly detection model in these regions.
3. **Final Anomaly Map Generation**: Perform element - wise multiplication between the suppression map and the original anomaly map to generate the final anomaly map. This can ensure that the newly added normal patterns will not be misjudged as abnormal, while other types of anomalies can still be correctly identified.
### Experimental Results
The experimental results show that the NAND method can effectively improve the performance of the anomaly detection model, especially when dealing with newly added normal patterns. Through the test on the MVTec AD dataset, NAND shows significant performance improvement on multiple anomaly detection models.
In conclusion, the NAND method proposed in this paper provides a flexible and efficient solution for industrial image anomaly detection, which can quickly adapt to new normal patterns without retraining the model. This has important significance in practical applications, especially in those industries that need to frequently adjust quality control standards.