DensityLayout: Density-Conditioned Layout GAN for Visual-Textual Presentation Designs.

HsiaoYuan Hsu,Xiangteng He,Yuxin Peng
DOI: https://doi.org/10.1007/978-3-031-46308-2_16
2023-01-01
Abstract:Generating layouts for visual-textual presentation designs aims at arranging elements such as logo, text, and underlay on the given images, which is the key to automating poster designs. It is challenging since the compositions of images, the spatial patterns of layout elements, and their cross-relationships need to be simultaneously considered. Existing works focus on the cross-relationships and either (1) suffer from the instability of adopting off-the-shelf saliency maps as prior knowledge or (2) require the semantic content of each element. To this end, this paper presents an efficient density paradigm that requires neither off-the-shelf models nor additional training data other than image-layout pairs. Under this paradigm, a three-stage approach using GAN is proposed, entitled DensityLayout . First, a density mapping network weakly supervised by the custom consistency loss will translate given images to spatial distributions of elements. Second, a multi-scale strategy is proposed to enhance understanding of the maps, and a generator conditioned on these visual features will generate preliminary layouts. Finally, a directed graph representation illustrating the inclusion relationships between elements is presented, and a graph convolution network will fine-tune the layouts. The effectiveness of the proposed approach is validated on CGL-Dataset, showing it achieves the best performance by generating visually appealing layouts for visual-textual presentation designs of diverse images.
What problem does this paper attempt to address?