High-Quality Pluralistic Image Completion Via Code Shared VQGAN.

Chuanxia Zheng,Guoxian Song,Tat-Jen Cham,Jianfei Cai,Dinh Phung,Linjie Luo
DOI: https://doi.org/10.48550/arxiv.2204.01931
2022-01-01
Abstract:the richness of the representation further facilitate the subsequent deploy- ment of a transformer to effectively learn how to composite and complete a masked image at the discrete code domain. Based on the global context well-captured by the transformer and the available visual regions, we are able to sample all tokens simultaneously, which is completely different from the prevailing autoregressive approach of iGPT-based works, and leads to more than 100 × faster inference speed. Experiments show that our framework is able to learn semantically-rich discrete codes efficiently and robustly, resulting in much better image reconstruction quality. Our diverse image completion framework significantly outperforms the state-of-the-art both quantitatively and qualitatively on multiple benchmark datasets. Computing
What problem does this paper attempt to address?