Graph-based Deep Generative Modelling for Document Layout Generation

Sanket Biswas,Pau Riba,Josep Lladós,Umapada Pal
DOI: https://doi.org/10.48550/arXiv.2107.04357
2021-07-09
Computer Vision and Pattern Recognition
Abstract:One of the major prerequisites for any deep learning approach is the availability of large-scale training data. When dealing with scanned document images in real world scenarios, the principal information of its content is stored in the layout itself. In this work, we have proposed an automated deep generative model using Graph Neural Networks (GNNs) to generate synthetic data with highly variable and plausible document layouts that can be used to train document interpretation systems, in this case, specially in digital mailroom applications. It is also the first graph-based approach for document layout generation task experimented on administrative document images, in this case, invoices.
What problem does this paper attempt to address?