Abstract:With the development of deep learning, the unsupervised visual anomaly detection and localization task has gained significant attention in both academia and industry, where only normal data are used for training. Existing methods for this task typically train using all training data simultaneously. However, in practical industrial scenarios, new product classes are usually introduced incrementally, leading to the sequential availability of training data. Such scenarios demand methods for class-incremental anomaly detection and localization (CADL). The main challenge of class-incremental learning is to retain knowledge of old classes when learning new classes. In this article, we aim to effectively leverage limited exemplars of old classes to retain knowledge for the CADL task. Achieving this goal requires a model that can efficiently capture anomaly-identification-related knowledge from limited exemplars. Considering that pixel-level anomaly identification requires an understanding of the surrounding context, we treat context within inputs as valuable anomaly-identification-related knowledge and design a context-aware feature reconstruction (CFR) model to capture such knowledge. Moreover, to avoid inter-class context conflict that may arise with class increments, we design an intermediate feature organization strategy. This strategy and output-level knowledge distillation jointly form dual constraints to regularize the model at both mid-feature and output levels. Utilizing the CFR model with Dual Constraints, the proposed CFRDC can effectively retain old-class knowledge while learning new classes, thus addressing the CADL task. Experimental results on the commonly-used MVTec-AD dataset demonstrate the effectiveness and outstanding performance of the proposed method in CADL.

DiffCAD: Weakly-Supervised Probabilistic CAD Model Retrieval and Alignment from an RGB Image

DiffCAD: Weakly-Supervised Probabilistic CAD Model Retrieval and Alignment from an RGB Image

Weakly-Supervised End-to-End CAD Retrieval to Scan Objects

Scan2CAD: Learning CAD Model Alignment in RGB-D Scans

Mask2CAD: 3D Shape Prediction by Learning to Segment and Retrieve

FastCAD: Real-Time CAD Retrieval and Alignment from Scans and Videos

Point2CAD: Reverse Engineering CAD Models from 3D Point Clouds

Zero-Shot 3d Pose Estimation of Unseen Object by Two-Step Rgb-D Fusion

Img2CAD: Conditioned 3D CAD Model Generation from Single Image with Structured Visual Geometry

CAD-Estate: Large-scale CAD Model Annotation in RGB Videos

CAD-Deform: Deformable Fitting of CAD Models to 3D Scans

Context-aware Feature Reconstruction for Class-Incremental Anomaly Detection and Localization

SPARC: Sparse Render-and-Compare for CAD model alignment in a single RGB image

Robust 3D Reconstruction with an RGB-D Camera

Img2CAD: Reverse Engineering 3D CAD Models from Images through VLM-Assisted Conditional Factorization

PS-CAD: Local Geometry Guidance via Prompting and Selection for CAD Reconstruction

GenCAD: Image-Conditioned Computer-Aided Design Generation with Transformer-Based Contrastive Representation and Diffusion Priors

NeurCADRecon: Neural Representation for Reconstructing CAD Surfaces by Enforcing Zero Gaussian Curvature

Learning Deep Object Detectors from 3D Models

SECAD-Net: Self-Supervised CAD Reconstruction by Learning Sketch-Extrude Operations

Model2Scene: Learning 3D Scene Representation via Contrastive Language-CAD Models Pre-training