Controllable Image Synthesis of Industrial Data Using Stable Diffusion

Gabriele Valvano,Antonino Agostino,Giovanni De Magistris,Antonino Graziano,Giacomo Veneri
2024-01-06
Abstract:Training supervised deep neural networks that perform defect detection and segmentation requires large-scale fully-annotated datasets, which can be hard or even impossible to obtain in industrial environments. Generative AI offers opportunities to enlarge small industrial datasets artificially, thus enabling the usage of state-of-the-art supervised approaches in the industry. Unfortunately, also good generative models need a lot of data to train, while industrial datasets are often tiny. Here, we propose a new approach for reusing general-purpose pre-trained generative models on industrial data, ultimately allowing the generation of self-labelled defective images. First, we let the model learn the new concept, entailing the novel data distribution. Then, we force it to learn to condition the generative process, producing industrial images that satisfy well-defined topological characteristics and show defects with a given geometry and location. To highlight the advantage of our approach, we use the synthetic dataset to optimise a crack segmentor for a real industrial use case. When the available data is small, we observe considerable performance increase under several metrics, showing the method's potential in production environments.
Computer Vision and Pattern Recognition,Machine Learning
What problem does this paper attempt to address?
This paper aims to address the issue of insufficient supervised deep neural network training data required for defect detection and segmentation in industrial environments. Specifically, the paper proposes a new method to repurpose general pre-trained generative models on industrial data to generate self-annotated defect images. This method is divided into two main steps: 1. **Learning New Concepts**: First, the model is made to learn new concepts, i.e., the new data distribution. 2. **Learning Conditional Constraints**: Then, the model is forced to generate based on specific label-driven constraints, generating industrial images with specific topological features and geometric locations. Through this method, significant improvements in model performance can be achieved even with a small amount of data. Experimental results show that this method can effectively optimize crack segmentation models in real industrial use cases and demonstrate its potential in industrial burner instance segmentation.