Abstract:One of the most challenges for anomaly detection (AD) is how to learn one unified and generalizable model to adapt to multi-class especially cross-class settings: the model is trained with normal samples from seen classes with the objective to detect anomalies from both seen and unseen classes. In this work, we propose a novel Proposal Masked Anomaly Detection (PMAD) approach for such challenging multi- and cross-class anomaly detection. The proposed PMAD can be adapted to seen and unseen classes by two key designs: MAE-based patch-level reconstruction and prototype-guided proposal masking. First, motivated by MAE (Masked AutoEncoder), we develop a patch-level reconstruction model rather than the image-level reconstruction adopted in most AD methods for this reason: the masked patches in unseen classes can be reconstructed well by using the visible patches and the adaptive reconstruction capability of MAE. Moreover, we improve MAE by ViT encoder-decoder architecture, combinational masking, and visual tokens as reconstruction objectives to make it more suitable for anomaly detection. Second, we develop a two-stage anomaly detection manner during inference. In the proposal masking stage, the prototype-guided proposal masking module is utilized to generate proposals for suspicious anomalies as much as possible, then masked patches can be generated from the proposal regions. By masking most likely anomalous patches, the “shortcut reconstruction” issue (i.e., anomalous regions can be well reconstructed) can be mostly avoided. In the reconstruction stage, these masked patches are then reconstructed by the trained patch-level reconstruction model to determine if they are anomalies. Extensive experiments show that the proposed PMAD can outperform current state-of-the-art models significantly under the multi- and especially cross-class settings. Code will be publicly available at https://github.com/xcyao00/PMAD.

MSTAD: A masked subspace-like transformer for multi-class anomaly detection

A Diffusion-Based Framework for Multi-Class Anomaly Detection

Masked Swin Transformer Unet for Industrial Anomaly Detection

A Novel MAE-Based Self-Supervised Anomaly Detection and Localization Method

Self-Supervised Masked Convolutional Transformer Block for Anomaly Detection

Masked feature regeneration based asymmetric student–teacher network for anomaly detection

UTRAD: Anomaly detection and localization with U-Transformer

Exploring Plain ViT Reconstruction for Multi-class Unsupervised Anomaly Detection

A Unified Model for Multi-class Anomaly Detection

Masked Transformer for image Anomaly Localization

An industrial product surface anomaly detection method based on masked image modeling

Hierarchical Vector Quantized Transformer for Multi-class Unsupervised Anomaly Detection

DMAD: Dual Memory Bank for Real-World Anomaly Detection

MoEAD: A Parameter-efficient Model for Multi-class Anomaly Detection

One-for-All: Proposal Masked Cross-Class Anomaly Detection

BTAD: A binary transformer deep neural network model for anomaly detection in multivariate time series data

MambaAD: Exploring State Space Models for Multi-class Unsupervised Anomaly Detection

AnomalyNCD: Towards Novel Anomaly Class Discovery in Industrial Scenarios

RDAD: A reconstructive and discriminative anomaly detection model based on transformer