Seal2Real: Prompt Prior Learning on Diffusion Model for Unsupervised Document Seal Data Generation and Realisation

Jiancheng Huang,Yifan Liu,Yi Huang,Shifeng Chen
2023-10-01
Abstract:In document processing, seal-related tasks have very large commercial applications, such as seal segmentation, seal authenticity discrimination, seal removal, and text recognition under seals. However, these seal-related tasks are highly dependent on labelled document seal datasets, resulting in very little work on these tasks. To address the lack of labelled datasets for these seal-related tasks, we propose Seal2Real, a generative method that generates a large amount of labelled document seal data, and construct a Seal-DB dataset containing 20K images with labels. In Seal2Real, we propose a prompt prior learning architecture based on a pre-trained Stable Diffusion Model that migrates the prior generative power of to our seal generation task with unsupervised training. The realistic seal generation capability greatly facilitates the performance of downstream seal-related tasks on real data. Experimental results on the Seal-DB dataset demonstrate the effectiveness of Seal2Real.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
This paper aims to address the challenges of seal-related tasks in document processing, such as seal segmentation, seal authenticity verification, seal removal, and text recognition under seals. These tasks rely on labeled seal document datasets, which are currently very limited. Therefore, researchers propose the Seal2Real method. Seal2Real is a generative method that can generate a large number of labeled document seal data and construct a Seal-DB dataset containing 20,000 images. Seal2Real utilizes a pretrained Stable Diffusion Model and proposes a prompt prior learning architecture to transfer the model's generative capability to seal generation tasks through unsupervised training. This method improves the realism of generated seals, thereby enhancing the performance of downstream seal-related tasks on real data. Experimental results demonstrate that Seal2Real performs well on the Seal-DB dataset, validating its effectiveness in seal generation and implementation. Furthermore, the paper emphasizes the role of seal forgery networks, which can improve the authenticity of generated seals, enhance the quality of the dataset, and facilitate the training effects of downstream tasks. User studies and evaluations of downstream tasks, such as seal segmentation, authenticity verification, and text recognition, further confirm the advantages of the Seal2Real method. Despite limitations, such as the insufficient size of actual datasets possibly leading to overfitting and biases, future work will focus on enhancing data diversity.