A Deep Learning Network Based End-to-end Image Composition.

Xiaoyu Zhu,Haodi Wang,Zhiyi Zhang,Xiuping Wu,Junqi Guo,Hao Wu
DOI: https://doi.org/10.1016/j.image.2021.116570
IF: 3.453
2022-01-01
Signal Processing Image Communication
Abstract:Currently, high-quality image composition largely depends on multiple user interactions and complex manual operations. In particular, the process of composition object extraction and region determination has become a burden that cannot be underestimated, restricting wider applications. Aiming at this problem, we propose an end-to-end image composition method that combines powerful deep-learning-based application modules such as image retrieval and instance segmentation to realize efficient non-interactive image composition. Specifically, the retrieval module, which is based on the attention mechanism, can determine semantically similar material images. Moreover, the content of interest (COI) extraction and optimization procedure is able to select the most proper instance among the material images. Finally, we propose the double-sieving strategy, which locates the best composition position in the target image. Using these effective modules, we carried out niche targeting experiments using an image database with high plausibility. The realistic experimental results illustrate that our method can achieve effective and reasonable end-to-end image composition.
What problem does this paper attempt to address?