DDPM-MoCo: Advancing Industrial Surface Defect Generation and Detection with Generative and Contrastive Learning

Yangfan He,Xinyan Wang,Tianyu Shi
2024-05-10
Abstract:The task of industrial detection based on deep learning often involves solving two problems: (1) obtaining sufficient and effective data samples, (2) and using efficient and convenient model training methods. In this paper, we introduce a novel defect-generation method, named DDPM-MoCo, to address these issues. Firstly, we utilize the Denoising Diffusion Probabilistic Model (DDPM) to generate high-quality defect data samples, overcoming the problem of insufficient sample data for model learning. Furthermore, we utilize the unsupervised learning Momentum Contrast model (MoCo) with an enhanced batch contrastive loss function for training the model on unlabeled data, addressing the efficiency and consistency challenges in large-scale negative sample encoding during diffusion model training. The experimental results showcase an enhanced visual detection method for identifying defects on metal surfaces, covering the entire process, starting from generating unlabeled sample data for training the diffusion model, to utilizing the same labeled sample data for downstream detection tasks. This study offers valuable practical insights and application potential for visual detection in the metal processing industry.
Computer Vision and Pattern Recognition,Artificial Intelligence
What problem does this paper attempt to address?
The paper aims to address two main issues in industrial product defect detection: obtaining sufficient effective sample data and using efficient and convenient model training methods. Specifically, the authors propose a new defect generation method, DDPM-MoCo, to solve these problems in the following ways: 1. **Insufficient Sample Data**: Utilizing the Denoising Diffusion Probabilistic Model (DDPM) to generate high-quality defect samples, addressing the issue of insufficient sample data during model training. 2. **Training Efficiency and Consistency**: Using the unsupervised learning Momentum Contrast model (MoCo) and its enhanced batch contrastive loss function to train the model on unlabeled data, solving the efficiency and consistency challenges of large-scale negative sample encoding during diffusion model training. Experimental results show that this method can improve the visual detection of metal surface defects, covering the entire process from generating unlabeled sample data to using the same labeled sample data for downstream detection tasks. The research provides valuable practical insights and application potential for visual detection in the metal processing industry.